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NOVEL COMPOSITIONS AND METHODS FOR CANCER 

0 «f 1 1 <; q N s 09/747 ,377, filed December 

The present application is a continuing appi.cat.on of jons 

December 20. 2001 . all of which are expressly incorporated here, by reference. 
15 FIELD OF THE INVENTION 

20 methods. 

BACKGROUND OF THE INVENTION 

genes. 

3 5 new protooncogenes. 
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or Akv, is a potent .noucer 0 . — 

carried in the germline. A number of sequences have been d n t ^ as rele 
action of lymphoma and leuKemia by analyzing Sorensen et 

al., J. Virology 70:4063 (1996); Sorensen et al.. J. Virology 67.n v 
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26.308 ,2000,; ^ U e, N*u* Gen** OM6 l« - - — - 
expressly incorporated by reference herein. 

. ~« ^oiran^rs involving the lymphatic system and are generally 

pi Jit increase in me number of Oocytes and the amount of bone marrow. w*h 
o enlargement of the spleen and lymph nodes. 

• n „. 0 fthemostsignificantdiseasesthataffectswomen. At the current rate. 
Breast cancer ,s one of the most -gn «, gg ^ 

American women have a In n* i tf«*P 9 ^ ^ ^ . Rg 

Society. 1992). Treatment of breast cancer at later siag 
* making eariy detection a high priority in medical management of the d,sease. 

particular in oncogenesis. 
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SUMMARY OF THE INVENTION 



Also provided herein are memou* , a)so pr0 vided herein, 

cell. Methods of treatment of carcinomas. .ndud,ng d.agnosis. are also p 

- ^th^ of screening drug candidates comprises providing a cell that 
In one aspect, a method of screening orug ferred emboaime nts 

: Ss=s=ss=s=s. , == r 
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the drug candidate. 
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• • -thcd f screening for a bioactive agent capable of b.nd.ng to a 
A,so provided herein ,s a method f screenmg to bio active 
CA protein (CAP), the method comprising comb,n-ng the 
ag ent and determining the binding of the candidate agent to the CAP. 

^ r«r ^.reenino for a bioactive agent capable of modulating 
Fort her provided herein is a comprises combining the CAP and a 

the «*» of a CAP. «n one J Mndidate agent on the bioacthrity 

candidate bioactive agent, and determmmg the effect of the can 
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of the CAP. 
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. . „ teamethod of evaluating the effect of a candidate carcinoma drug comprising 
Also provided is a method ot eva.ua a The 

comparing the expression profile of the panem » « ^ 

p^eraW «tec«<l tan tte graup conasWJ * •» WW" 
their complements. 

«« selected *om M group of sequences ^"^^JL-h-l 
Preferably . the method comprises contacting an agent specific for P 
protein in an amount sufficient to effect neutralization. 

CA protein, preferably selected from the sequences outlined in Tables 

*«h for diaanosing or determining the propensity to carcinomas. 
Ateo provided herein Is a method for diagnosing , o „ (ymphoma gene 

especially lymphoma or leukemia by sequencing nrm/k . e d for determining 

WodW *np»»™ and M«M 9 «* «« " a " 



The present invention 



DETAILED DESCRIPTION OF THE INVENTION 

* ,»nrj»s associated with carcinomas, 
is directed to a number of sequences associa 
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clonally mtegrare p mechanism are used to .solate 

la J nos. protooncenes or parogen* MM*. »« g-» - 

«e oroviral integration is random, rare integrants will activate host 

o new proviruses at clonal stotehiometries In tumors. 

Tne use of oncogenic retroviruses, whose sequences insert into the genome of the host 
-me use or onoog identification of host sequences involved in 

organism resulting in carcinoma. *m W include 

M caralsobelouadloomer^ofcancersaswell.ootllnedbetaw. 

25 ^-^--^^«**- h ** , **-■ ,,,, "■ n,, * 

MM. cancer, w«ch can be diasnosed or screens* to, ustn, t*e memods of thep^ 

^Lm- pentoneum. omenam end -aesenter,; other digest canoe* oftf» 
T^l^Lca* ^eer.erK.eineeesi^.Wendbconcnu^w; 
35 respiratory system (nasal cm, r™ . .,^.^- Mk)m a- bones and Joints; 

====S=Ss=.-=r 

skin cancers; Kapos.s sarcoma and ^ ^ ^ 

(cervix uteri; corpus uten; uterus, nos. ovary, vagina. 
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of the male genital system (prostate g.and; teste; penis; and other mate genital); cancers of 
the urinary system (urinary btedder, kidney and renai pelvis; ureter, and other unnary); 
cancers of the eye and orbit; cancers of the brain and nervous system (brain; and other 
nervous system); cancers of the endocrine system (thyroid gland and other endocnne. 
inciuding thymus); cancers of the tymphomas (hodgkin's disease and non-hodgk-n s 
lymphoma), muitiple myeloma, and teukemias (lymphocyte teukemia; myeloid teukem,a; 
monocytic leukemia; and other leukemias). 

Other cancers, dassrfied by histological type, that may be associated with the sequences of 
, the invention include, but are not limited to, Neoplasm, malignant; Cardrrcma NOS; 

Cardnoma. undifferentiated. NOS; Giant and spindle cel. cardnoma; Small ce Icaranoma. 
NOS- Papillary cardnoma. NOS; Squamous cell cardnoma. NOS; Lymphoep,thel,a. 
cardnoma; Basal cel. cardnoma. NOS; Pilomatrix carcinoma; Transitional cell carcnoma 
NOS; Papillary transitional cel. cardnoma; Adenocarcinoma. NOS; Gastnnoma. malrgnant 
5 Cholangiocardnoma; Hepatocellular carcinoma. NOS; Combined hepatocellular caranoma 
and cholangiocardnoma; Trabecular adenocarcinoma; Adenoid cystic carcnoma; 
Adenocarcinoma in adenomatous polyp; Adenocardnoma. familial polypos.s col,; Sol* 
cardnoma. NOS; Cardnoid tumor, malignant Branchiolo^eolar adenocardnoma; Paprltery 
adenocarcinoma. NOS; Chromophobe cardnoma; Addophi. cardnoma; Oxyph .c 
, 0 adenocarcinoma; Basophil cardnoma; Clear cel. adenocarcinoma. NOS; Granular cel. 
cardnoma; Follicular adenocardnoma. NOS; Papiltery and follicular 
Nonencapsulating sclerosing cardnoma; Adrenal cortical cardnoma; Endometrord caranoma. 
Skin appendage cardnoma; Apocrine adenocarcinoma; Sebaceous ^enocaranon^ 
Ceruminous adenocardnoma; Mucoepidermoid carcinoma; Cystadenocaranoma, NOS. 
25 Papillary cystadenocaranoma. NOS; Papillary serous cystadenocarcinoma; ^-nous 

Xnlrdnoma. NOS; Mucous adenocardnoma; Sfcnet rtng cel. caranoma; «ng 
duct carcinoma; Medullary carcinoma. NOS; Lobular carcnoma; Inflammatory caranoma. 
Pagers disease, niammary; Adnarc*lc*rdn^^^ 

Adenocardnoma w/ squamous metaplasia; Thymoma, malignant; Ovarian stroma, tumor. 

30 malfcnant Tneooma. malignant Granulosa cel. tumor, malignant; Androblastoma, mahgnant 
Sertoli cel. cardnoma; Leydig cell tumor, malignant; Lipid cell tumor, malignant 
Paraganglia, malignant Extra^mmary paraganglioma, malfcnant; Pheochromocytoma. 
Glomangiosareoma; Malignant melanoma. NOS; Amelanotic melanoma; Superfical 
tZZ —a; M,ig metenom* in giant pigmented nevus; ^ cei, melanoma 

35 ™. malignant Sarcoma. NOS. Fibrosarcoma. NOS; Fibrous his«ocytoma. n^gnant 
Myxosarcoma; Liposarcoma. NOS; Leiomyosarcoma, NOS; 

Embryonal rhabdomyosarcoma; Alveolar rhabdomyosarcoma; Stroma, sarcoma, NOS. Mrxed 
*mor. malignant. NOS; MulleHan mixed tumor. Nephroblastoma; Hepato J— * 
Carcinosarcoma. NOS; Mesenchymoma, malignant; Brenner tumor, malrgnant, Phyllodes 
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.M. ,..*-* ^ — tHI*—— • "*-°^*™" 
Embryonal earcinoma. NOS; Teratoma. HO* ««™ — • mah3 " a * 

LLam Mesooephroma. Hemangiosereoma; Hen«ng«nd~ 

matfcnanl; Kaposreearcoma; Hemangiopericytoma, maligna* Lymphosarcoma. 
Osfcoearcoma. NOS; JuxBcortk* osteosarcoma; Cbor«*>ea™ma. NO* 
Choada.Mas.oma. malignant Masanoh^a. chondmsarcoma; M«i » - «™ 
Ewbtfs earcoma; Odomogenlc tumor, mabgnanl; Amelias.* o^ 1 "™^ 
AmeLiastoma. malignant; Ama^abo nbmsarcorna; Pbtealoma. maltgnan IM« 

* me ™ " „„.. ldtlIvM NOS; Protoplaamic astrocytoma; 

Glioma. malignantEpondymoma. NOS. Astrocytoma, NUO ' 

, p^aabPcylomaiAsb^m^^ 

Otiaodendroblastoma; Primitive neuroectodennal; CerebeBar sarcoma. NOS: 

Jr. Meningioma, n*ton*nb Neurosarcoma; ^""""^T^^ 
woo, maligna,* Malignant lymphoma. NOS; HodgkWsd.aease.NOS.Hodglone. 

5 ^rina.NO S ;«i^.V"^.^»'»^^^ m, ^ a '^ 
Mgftft lymphomas; M^naa, Mu«ple myotome; Meat ce» »»n» 

Imaoa. M W— * L»m*os»oma ce. touKem*; 
le*emla; Megabatvoblasbc leuKemto; Myeloid sarcoma; and Hal, oa. leubem*. 

,„ ma genes n»y ba ^vad m c*er rbeeaees. such aa M no, «m«ed to diaaaaea 

associated vritb aging or rtetinxtegenetativa diseases. 

AsaodaUon i„ thi. 0C** meana matme nudeobde or proh* "^ m J*>™ 
ZZL, expresaad, acbvated. meclhrafcd o, anered in oamiaomaa aa compared to 
rSTt oobined b-ow. OA aagaence. b*de moaa *=, . op-reg^ 
caressed at a higher level), aa well as those that are down-regulated (,.e. expressed ata 
Z^, to cLomaa. C*ee,uenceea to ««* sequences whtoh have been a^ed 
r,^sa^r^ora^e^^aobs^de^r» W ^s.^d, n g 

« OA aagoe^ea a. bom b^; - - * 

b» those h » art. OA sequences ton orhe, organisms may be mm antn* mo«» tf 
l^^o^av.^mua.o^OAseooenoesa^p^.tamv^^ 

b^ng otomm*. coding crfeh* («, mJoe. hamstere. gu^ea «a. 

(including sheep. ,oa». P*. oows. hoyaes. etc). In some oaaea. prokaryobo CA 

techniques outlined below. 
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CA fences can in<*de both nueleie acid and amino acM sequences In • P<*™V 
tJl^^CAa^uancesa^rac^lnan.™^^. By* term recomfcnan. 

„„cieie acid" herein is meant nuelefc acid. originally fomred in »«ro. In general by the 
^pofction a. nu*e ac* by polymerases and endonu*ases. » a forn, 
|„ aafure. Thus an ieoiafed nucleic acW. in a ftnea, torn, or an axeman veaer ton« In 
It «M DMA metecutea that are not normally iomed. are to* eons*.,* rec— 

rjp^o4 -«* ™e™»° » » «— * «* °" M 3 reCOmWnan ' T T, a 
ma* and refndoduoed m a hoa. «• o, orgenJsnt. « - m— no.^coynMranrty. 

US MWInvt»c*»mac^«».Who««--^^^«^"^^ 

Zl. such nue* adds, ortce prcrfuoed recomdnandy. almough 

Sin**, a -recant*"* proMT I. . prt**- maOa as* rcaombinant teehniqu*. la 

mot* is dietinguisired ton naturally occumng pmtein by at teas. oneormare 
£L«» « ettampte. ■. prdein may to isolaW or puh«ed away .ran aomn or a. C 

„ « substance pare. For exampfc. an iso*- pr«ain is uneacemp^ by a, *asf 

lain in a given sample. A subsbtaMy pare prottln oompriaaa a. teas. about 75% by 
IS * th La, pro*. - a. tat about 80% bam, preferred, and a, leaa, abc*»% 

Z pamoaW i— • *• — " " " "TdeTa 

.. o^lmal^o^oahoato* 

*„igcan»y hW-r ccmceneato man is natn^ seen, though fh. us. of an m^bfe 
eLoterorhi 9 h«p^pronx»tar.sachmatmepn^ismedaa,mc re a^ 

Z* are addflon of an epdope bag or ammo acid subatitudons. unions and demons, as 
30 discussed below. 

^.p^mabodm^maCArra^a^nacWoaaiea. As « to j* 
those in the art and ia more hilly outlined below, CA sequences are useful m a vanety of 
Including diagnostic app.ica.ans. which « deW e^ oacumng 

35 t^Tas we. a. acree*, * — * ***** "TTT 

m the CA sequences can be genanded. In ate facades, sense. then, by -no*c aad or 
-o.^ucleo«de-or g rarnma«calem*a te ™ahe,e« means a. leaa. two mrchxtbdea 
1^1 Bntced together. A nuclei acid of me present inuendon will genendly contain 

Jfc -cagh ,n some cases, as ou,ned below „or euamp* in anffcense 
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applications or when a candidate agent is a nucleic acid), nud ic ack, anaiogs may be used 
that have alternate backbones, comprising, for example, phosphoramidate (Beaucage : e a... 
Tetrahedron 49(10):1925 (1993) and references therein; Letsinger, J. Org. Chem. 35:3800 
(1970); Sprinzl et al.. Eur. J. Biochem. 81:579 (1977); Letsinger et al.. NucL Acds Res. 
14-3487 (1986); Sawai et al. Chem. Lett 805 (1984). Letsinger et al.. J. Am. Chem. Soc. 
110 4470 (1988); and Panels et al.. Chemica Scripta 26.141 91986)). phosphorothioate 
(Mag et al.. Nucleic Acids Res. 19:1437 (1991); and U.S. Patent No. 5.644.048). 
phosphorodithioate (Briu etal., J. Am. Chem. Soc. 111:2321 (1989), O- 
^thylphophoroamidte linkages (see Eckstein. Oligonucleotides and Analogues: A Practical 
, Approach. Oxford University Press), and peptide nucleic acid backbones and linkages (see 
Eghotin. J. Am. Chem. Soc. 114:1895 (1992); Meier eta... Chem. Int Ed. Eng.. 31: 008 
(1992)- Nie.sen. Nature. 365:566 (1993); Carisson et a... Nature 380:207 (1996). a., of wh,ch 
are incorporated by reference). Other analog nucleic acids include those with posrtive 
backbones (Denpcy et al.. Proc. Natl. Acad. Sci. USA 92:6097 (1995); non-ionic backbones 
5 "nt Nos^.386.023. 5.637.684. 5.602.240. 5.216,41 and 4.469.863; Kedrowsh. et 
al Angew. Chem. Intl. Ed. English 30:423 (1991); Letsinger et al.. J. Am. Chem. Soc. 
110 -4470 (1988); Letsinger et a... Nucleoside A Nucleotide 13:1597 (1994); Chapters 2 and 3. 
ASC Symposium Series 580. "Carbohydrate Modifications in Antisense Research . Ed. Y.S. 
Sanghui and P. Dan Cook; Mesmaeker et al.. Bioorganic & Medicinal Chem. Lett 4*95 
, 0 (J* Jeffs et a... J. Biomo«ecu.ar NMR 34:17 (1994); Tetrahedron Lett 37:743 (1*6» and 
non-ribose backbones, induding those described in U.S. Patent Nos. 5.235.033 ^and 
5.034.506. and Chapters 6 and 7. ASC Symposium Series 580. "Carbohydrate ^cations 
in Antisense Research". Ed. Y.S. Sanghui and P. Dan Cook. Nude, adds 
more carbocydic sugars are a.so induded wtthin one definMon of nude,c acds (see Jenk,ns 
25 et a. Chem. Soc. Rev. (1995) PP 169-176). Severs, nudeic add anaiogs are descnbed in 
Rawls C&E News June 2. 1997 page 35. A., of these references are hereby expressly 
incorporated by reference. These modifications of the ribose-phosphate backbone may be 
done for a variety of reasons, for exampie to increase the stability and ha Wife of such 
mo.ecu.es in physiologies, environments for use in anti-sense appHcations or as probes on a 

30 biochip. 

As wH. be appredated by those in the art, all of these nudeic add anatogs may find use in the 
present invention, .n addition, mixtures of natural* occurring nucleic adds and analogs can 
be made; alternatively, mixtures of drfferent nucleic acid analogs, and mixtures of natura..y 
35 occurring nucleic acids and analogs may be made. 

The nucleic acids may be single stranded or double stranded, as specified, or contain 
portions of both double stranded or single stranded sequence. As will be appreciated by 
hose in the art. the depiction of a single strand "Watson" a.so defines the sequence of the 
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^nd V»dC thus the sequences described h re* also indudes the eompttnen. of 
TjTJt 1* acTmay be DNA. both gen-* and cDNA, RNA o, a hybrid. 
!£ZZ£ add eontdns an, comdnafion o, and rtbo*t*o,«es. and any 

bases, inddding utadi. adedne, thymine, cynosin.. guanine, tnosue. xanthtne 

d nlo.de and nd^eobde an*,, and mod*ed 

ZL nodeosidea ,n add»on. * -"«• 

MM. Thus nx tamp* ■• mdbAto. a* of a pe^M. nodac ac* radr coning 

base, ate reletted to herein as a nudeodde. 

' An CAsequenee can be iniM, identtned b, aebstantial nadaic add and/or amino add 

Z^^^CAeeddencea^heteln. «~££L,, 
2. M ov«a. nudde add or amino add eequenee. and is t^nerelly ddemuned as odfined 
below, using either homdogy programs or hybridization ooodBona 

5 n» CA sequences of the invention were initJaHy identified as described herein; basically. 

TI»CA.«M~oud«l^co"^'*-^^ to *^ In general, the 
^TZL gene and adiyabng » (e g. P-on«*r insertion); secondly, by aancabng a 

Z L* robo*os enhancers. ,n*ddg St». are Known » ad on genes op » 
25 approximately 200 Wlobasesdthe insertion site. 

,„ . prow enfcodfiment OA sequel aro Orose tna, aro ap«gda« in 

£ Z. dfi>ron«on a-ga •*»■*•-> as »ed ta* mean. a, tead ab^60%. 
2 ^1, a. .as, abou, ,00%. roaro prefereb* a, .as, abod 
a, M abod 200%. w«h from 300 fin a, iead 1000% being aspect preferred. 

In a preferred embodiment CA sequences are those that are down-regulated in »M« 
M "me exprosdon of fh.se genes is lower in oaroinoma bssue as compared to nonnd , 
STZ. L ddfieronWdr, daga. -DowrMe^" * — 

m more preferedy a, M abod 100%. moro preferedy a, teas, abod 150%. more 

In a prefened embodiment. CA sequences at those Ota, ate altered tni, show either the 
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expression profile or an altered profile as compared to normal *mphoid tissue of the same 
differentiation stage. "Altered CA sequences" as used herein refers to sequences which are 
truncated, contain insertions or contain point mutations. 

CA proteins of the present invention may be classified as secreted proteins, transmembrane 
proteins or intracellular proteins. 

,„ a preferred embodiment the CA protein is an intracellu.ar protein. Intracellular proteins 
may be found in the cytoplasm and/or in the nucleus, .ntracellular proteins are involved in all 
aspects of cellular function and replication (including, for example, signaling pathways); 
aberrant expression of such proteins results in unregulated or disregulated cellular processes. 
For example, many intracellular proteins have enzymatic activity such as protein kinase 
activity protein phosphatase activity, protease activity, nucleotide cyclase actmty. 
polymerase activity and the like. Intracellular proteins also serve as docking proteins that are 
involved in organizing complexes of proteins, or targeting proteins to various subcellular 
localizations, and are involved in maintaining the structural integrity of organelles. 



20 
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An increasingly appreciated concept in characterizing intracellular proteins is the presence in 
the proteinsofoneormoremotifsforwhichdefined functions have been attnbuted. In 
addition to the highly conserved sequences found in the enzymatic domain of proteins, hrghfy 
conserved sequences have been identified in proteins that are involved in protein-proton 
interaction. Forexampte. Src-homo.ogy-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
domains, also bind tyrosine phosphorated targets. SH3 domains bind to praline-nch targets. 
,n addition. PH domains, tetratricopeptide repeats and WD domains to name only a few nave 
been shown to mediate protein-protein interactions. Some of these may also be involved ,n 
binding to phospholipids or other second messengers. As will be appreciated by one of 
ordinary skill in the art. these motifs can be identified on the basis of primary sequence; thus 
an analysis of the sequence of proteins may provide insight into both the enzymatic potential 
of the molecule and/or molecules with which the protein may associate. 

, n a preferred embodiment, the CA sequences are transmembrane proteins. Transmembrane 
proteins are molecules that span the phospholipid bilayer of a cell. They may have an 
intracellular domain, an extracellular domain, or both. The intracellular domains of su«* 
proteins may have a number of functions including those already described for .ntracellular 
proteins. For example, the intracellular domain may have enzymatic activity and/or may . 
serve as a binding site for additional proteins. Frequentty the intracellular doma-n of 
transmembrane proteins serves both roles. For example certain receptor tyrosine kinases 
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have both protein kinase activity and SH2 domains. In addition, autophosphorylation of 
tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domam 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane domains. For 
example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases 
and receptor serineAhreonine protein kinases contain a single transmembrane doma.n. 
However, various other proteins including channels and adenyly. cyclases conta,n numerous 
transmembrane domains. Many important cel. surface receptors are classified as seven 
, transmembrane domain" proteins, as they cental 7 membrane spanning reg-ons. .m porta* 
transmembrane protein receptors include, but are not limited to insulin receptor, ,ns ^e 
growth factor receptor, human growth hormone receptor. 

Lptor. epidermal growth factor receptor, tow dens«y lipoprotein receptor, eprderma. growth 
factor receptor, leptin receptor, inteneukin receptors. e.g. 1L_1 receptor. .L_2 receptor, etc. 

5 Characteristics of transmembrane domains include approximately 20 consecutive 

hydrophobic amino acids that may be followed by charged amino acids. Therefore, upon 
analysis of the amino acid sequence of a particular protein, the localization and number of 
transmembrane domains within the protein may be predicted. 

50 The extracellufar domains of transmembrane proteins are diverse; however, conserved motifs 
are found repeatedly among various extracellular domains. Conserved structure and/or 
functions have been ascribed to different extracellular motifs. Forexample. cytokine 
receptors are charactered by a cluster of cysteines and a WSXWS (W= tryptophan. S= 

25 serine X=any amino acid) motif, lmmunog.obu.in-.ike domains are highly conserved. Mucin. 
Tdomains may be invoived in cel. adhesion and leucine-rich repeats participate ,n protean- 
protein interactions. 

Man, a***!* domains are mvohad la bmama to o»w motecutea. In ana aspaA 
«* ^ ta MM prc^s. a, ama. mo*ou*a »*• *> 

35 Z «o <*—*«< moteoutes. >n «a -aapact m — — " 

LodrtdlB— •=»■»«— ~< U.O-acaUtaaxaa^viaa^ylphospnaM,^ 
Z, an J or ma, memaarvaa b. aaosmambraoa profcins. Etfrmfcte domans also 
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CA proteins that ar transmembrane are particular* preferred in the present invention as they 
are good targets for immunotherapeutics. as are described herein. In addition, as outlmed 
below, transmembrane proteins can be also useful in imaging modaht.es. 

It will also be appreciated by those in the art that a transmembrane protein can be made 
solude by removing transmembrane sequences, for example through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

in a preferred embodiment, the CA proteins are secreted proteins; the secretion of which can 
be either constitutive or regulated. These proteins have a signal peptide or signal sequence 
that targets the molecule to the secretory pathway. Secreted proteins are involved .n 
numerous physiological events; by virtue of their circulating nature, they serve to transmrt 
signals to various other cel. types. The secreted protein may function in an autocrine manner 
, (acting on the cel. that secreted the factor), a paracrine manner (acting on cells in close 
proximity to the cell that secreted the factor) or an endocrine manner (acting on cells at a 
distance) Thus secreted molecules find use in modulating or altering numerous aspects of 
physiology. CA proteins that are secreted proteins are particularly preferred in the present 
invention as they serve as good targets for diagnostic markers, for example for blood tests. 

° An CA sequence is initially identified by substantia, nucleic acid and/or amino add sequence 
homology to the CA sequences outiined herein. Such homology can be based upon the 
overa.. nudeic acid or amino acid sequence, and is generally determined as outhned below, 
using either homology programs or hybridization conditions. 

'' 5 as used herein, a nucleic acid is a "CA nudeic acid" if the overall homology of the nudeic 
add sequence to one of the nudeic acids ofTab.es 1-112 is preferably greater*™ about 
75%. more preferably greater than about 80%. even more preferably greater tnan about 85% 
and most preferady greater than 90%. In some embodiments the homdogy w... be as h,gh 

3 o as about 93 to 95 or 98%. In a preferred embodiment, the sequences which are used to 
determine sequence identity or simiiarity are selected fiom those of the nudeic adds of 
Tables 1-112. .n another embodiment, the sequences are naturally occurnng allehc vanants 
ofthesequencesofthenudeicacidsofTab.es 1-112. In another embodiment, the 
sequences are sequence variants as further described herein. 

" Homology in this context means sequence similarity or identity, with identity being preferred. 
A preferred comparison for homology purposes is to compare the sequence 
sequencing errors to the coned sequence. This homology will be determ,ned us,ng standard 
techniques known in the art. including, but not .imited to. the .oca. homdogy algonthm of 
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Smith & Waterman, Adv. Appl. Math. 2:462 (1981). by th homology alignment algorithm of 
Needleman & Wunsch. J. MoL Biol. 48:443 (1970). by the search for similarity method of 
Pearson & Lipman. PNAS USA 85:2444 (1988). by computerized implementations of these 
algorithms (GAP. BESTFIT, FASTA and TFASTA in the W,sconsin Genetics Software 
Package Genetics Computer Group. 575 Science Drive. Madison. Wl). the Best Fit sequence 
program described by Devereux et a... Nucl. Acid Res. 12:387-395 (1984). preferably using 
the default settings, or by inspection. 

One example of a useful algorithm is PILEUP. PILEUP creates a multiple sequence 
, alignment from a group of related sequences using progressive, pairwise alignrnents It can 
also plot a tree showing the clustering relationships used to create the alignment PILEUP 
uses a simplification of the progressive alignment method of Feng & Doolittle. J. MoL Evol. 
35-351-360 (1987); the method is similar to that described by Higgins & Sharp CABIOS 
5:151-153 (1989). Useful PILEUP parameters including a default gap weight of 3.00. a 
5 default gap length weight of 0.10. and weighted end gaps. 

Another example of a useful algorithm is the BLAST algorithm, described in Altschul et at. J. 
MO. Biol. 215. 40W10. (1990) and Karlin etal.. PNAS USA 90:5873-5787 (1993). A 
particularly useful BLAST program is the WU-BLAST-2 program which was obtained from 
Altschul et a... Methods in Enzymology. 266: 46(M80 (1996); http://WasLwustfl. WU-BLAST- 
, 0 2 uses several search parameters, most of which are set to the default values. The 

adjustable parameters are set with the following values: overlap span 1. overlap fracbon = 
0125 word threshold (T) = 11. The HSP S and HSP S2 parameters are dynamic values and 
are established by the program itself depending upon the composition of the particular 
sequence and composition of the particular database against which the sequence of .nterest 
25 is being searched; however, the values may be adjusted to increase sensitivity. A % ammo 
acid sequence identity value is determined by the number of matching identical rescues 
divided by the total number of residues of the "longer- sequence in the aligned region. The 
longer sequence is the one having the most actual residues in the aligned region (gaps 
introduced by WU-Blast-2 to maximize the alignment score are ignored). 

3 ° Thus "percent <%) nucleic acid sequence identity" is defined as the percentage of nucleotide 
residues in a candidate sequence that are iden«ca. with the nucleotide residues of the nude* 
acids ofTab.es 1-112. A preferred method utilizes the BLASTN module of WU-BLAST-2 set 
to the default parameters, with overlap span and overlap fraction set to 1 and 0.125. 

35 respectively. 

The alignment may include the introduction of gaps in the sequences to be aligned. In 
addition, for sequences which contain efther more or fewer nucleotides than those of the 
nucleic acids of Tables 1-1 12. it is understood that the percentage of homology w,ll be 
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Oararmlnnd Pasnd on tha number of ho^oua * ^ " 

nudnoaidaa Tnos. for axampla. homology fseounnoeaahortarthan ihosaof.h. aaqunnoaa 

and aa olsoosaed M - be deremdned u*n, M number o, nudeosrtee 

in the shorter sequence. 

TP*, for examp*. M «** «"** ^ — "* • ** ™** 

attnoenev ooodiHooa a* known id M art aee lor example Maniaba at al.. Moleoutor 

, ™ Manual. 2d Mta. 1M - *™ ^ * 

MMM*. M* incorporated by refemnee. Sblngenl condrbons « 

^Lependeo. and - * d«aran, * dine** cb»d«anc«. 

«W. JTb found h Tiiaeen. Technlpues In Bloohemleby and Mcteou* B <*>g»- 
, ^^»*Nuotalo*o M f^,~o,^o,d y ddd^a^ 

£tlwc Mr l» .normal mailing polo. (Tm) tor M .per* eequance « a dtfned 

^iraraoooZdat^dum). ^«r*-- «••"-••«** 

eeooo^n „ o*er « « pH 7.0 «. 8.3 and to. temped . « leas. aK* » C*» 
L probes <e.g. ,0 » 50 nueteobdea, and .« teastadout 60-0 to lung P«*» to 
25 ^nuojoddea,. m*****—**"*""*"*** 

destabilizing agents such as formamide. 

,„ anorna, am^m«rt e»*g«n. PyBte^adon oor^na an, used; 

and Ausubel, supra, and Tijssen, supra. 



30 



35 



maddMon d»CAnoo>eteacldse,uencesmlh.im^a re f^^ 

^o.onoo^napos^.loraxampla.maCA.a^m.ybdan enh*n*r M 
^.prcxooncogena. In o« oor^ lr«^ «^ re^. norn^ 

Sri-.^ood^andnon^ng^n,. ^"^T^ 
*,hose in the art using « seduenees pmv«ed herein, adddfcnal 

aZL o, ma m mngrn sx*uences; ana mm. a. al.. and M a. a,.. eupn.. M» 
ZZ-P— * ~ " te ~ PC "' ^ 



14 



PCMJS01/51291 

WO 03/008583 
PCR. 

Once toe CA nodeic add is ttntted. It can be doned and. « necessanr. «a confer, pads 
^ to form the entire CA nucleic add. Once ,so*ted Pom m nalura, 
lairred w«r a p*m» o. otoer vedor or exceed toera*m as a «near nude* add 
Zmt toe recombinant CA nude* acid oan be MM used as a probe to *nb* and 

^amor nude* ac» to nt*e ntodKed o, ,adam CA nude* adds and p^rns. 
Tn.rAn«rle*actoofthepresentinventtonareused in several urays. InaPrst 

Itu^t ^elg J* dfcgnoe.* n*toode. as c^ be*w. ., tea admln^on. fo, 
rZ^a^lnbsaneeapp^. 

5 expression of CA proteins, again either for screening purpu*« 
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patient 



to a pua^nad entoe*** nuuleic add probes to CA nude, -ft (both £ 

adacned to to. « **- « * — • TTZtoZ 
ZL adda. I... me target stance ,a^r toe large, seance 
U ■ * e*am*e in sand-ch assavs, sad, m 

Uuenca and toe prabe* o. the praeem tovendon cccu-a. As out ned 
JUnantorn, need no. be perto* toera ma» be an, number * basep-r 
rid* rrti we*™ arfto hybndiaafcn beNraen toe terse! sequence and toe angle sbended 

hldization can occur undo, even the lees, sldngen. o. h,bndlza«on condrbons. toe 
s^tnotaoon^en^ta^ee^ence. Thus, b, -subston^ cc^ntorv- 

h,w«e under nonnal reaction oondMone. padiculan, high stongenc, condrbons. as outoned 



herein. 



A nudeic add probe Is genera., single sanded bUcan be paraa* stogie end parPal* 
£TU1 Tne 5 «n.edne«o.toep ro beied^db»toesto^ra.co™^l»n 
SX— ■ in genera,, toenuc^ac^pn^ranoe^uts 
to abT.100 bases long, urrto Pen, about 10 to about 80 bases being prafcrrad. and tronr 

Sid m some embodiment much tonger nude* adds can be used, up to hundreds 
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,n a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That ,s two. 

particular target. The probes can be overlapping (i.e. have some sequence ,n common), or 
separate. 

A. «» to app-noiatnd t* tt»sn In .ria art. nocteio aaids oan Pa adached or intmatrilizad to a 
J^Tfn , «. variety of ways. By W*- and ^ «- 
a— „ «r* b— fha tan* ao» p-obo and 
suffcian. to be MM undar tha cond*oas of binding, washing, analysis, aad ramovaf m 
o^Two. Triabiad^oaabaoo.afaato.aoaK.ovatent e,-aoaoo»a te n.o,ad,a 9 aad 

La.Pl.ot* -riaraodona. fncfefed ia aoo-covafca. binding Is tha eovafea. attend oU 
n^fa so* as. » *• sappari aad M nonunion, binding of fha b»*aylaM 

^.Tsaap^ia. 

M*, sfgma boads. pf bonds and option Pond, Cov*nf bonds - MMd 
^L«Ld»p™i-and^aoidsupporio,canba^a»a=^f^.rb, 

SL» fal^nn^^ofa.^aa.^^o.oova^andaaa.oval.a. 
interactions. 
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,n genera., the probes are attached to the biochip in a wide variety o ways, a w .be 
appreciated by those in the art As described herein, the nucleic acids can etther be 



the biochip 



The bioeMp comprises a suteble solid substrate. By "substrate- or 'solid support- or other 
The Diocnip compi «o . fc . b modified to contain discrete 

arammatical equivalents herein is meant any matenal that can be moameo 
gramm . , 4K/s ^nhmpnt or association of the nucleic acid probes and is 

individual sites appropriate for the attachment or assoc.at.on o 

amenable to at least one detection method. As will be appreciated by those .n the art. the 
^ of possible substrates are very large, and indude. but are not limited to. glass and 
n^ed olctionalfced glass, plastics (including aorylics. po>styrene an of 
styrene and other materials, po*propy.ene, po ,ethy te ne. po^buty ene. po^e hanes. 
Teflon- etc.). polysaccharides, nylon or nitrocellulose, resins. s..,ca or s,..ca_based 

—» and modifled silicon, carbon, metals, inorganic glasses, etc. In 
general, the substrates allow optical detection and do not appreciably fluoresce. 
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to a p,efened embodiment th surface of to. bfcchlp and M probe ma, be denied ««h 
M Ldonal groups .or subseguen. aa.ca.aa. o..he m. Thus, for exampfe. to. 

ZZZ. J - ' — — in *^ " IT*, 

oroups caitoxy poops, oxo groups and toio. groups, wito amino groups bans paihc lady 
ZTZU — graaps. - prabaa aan ba adached ustog — groups 
on,hep<obes. Forexampto, „^ac M scpa«ngaa^g™p,-baa^ad,^ 

surfal camp*** an*» g^rps. » — » — — — ^ 
example t»no«rhetoro4>itonc«onai linkers as OT^^ 

cZ^aog. technic sa^oa aa crcssjnhe*. pagas .55J0O. tocomo,** hereto by 
St i~ to soma caaaa. add«.na, «nhere. such aa aTKyl groups <«*d,n. 
substituted and heteroallryl groups) may be used. 

la ,1* embodiment the ollgoaaoladtidas are syntoeslzed as Is .mown In toe an. and men 
s ^"ar^huaraaybaa.aphad^s^sapp^d.a.ac^a^be^aah 
internal nucleoside. 

nonfat For example. WodnyUded oligonudeoddes oaa be made. «h«* told to 
, 0 surfaces coualently coated with st™pBvldto.n^ldnginadachnw«. 

F „ »_|, photoaottuahoa techolquee uBalag photopotymenzehoa compounds and 
^ am used, tha^ea^^huo^a^c.nhe s^ed. 
25 Tig «e« ">«m photodthogrephlc techn*uea. such as those daaoht*. » WO 

£££ WO g5A5505; U.S. MM I*. 5.700.637 and 5.445.834: and ^ferenoea cited 
ZZ a* expmss* «o^ed by mterence; toese methods ofadachman. 
(ami the basis of the Asymetrix GeneChip technology. 

,„ ,„ addidoa to ma solid-phase taohnctog, rearmed by hiochip amtys. geae «p»s*n ran 
" rhaTualeeustog iirtoid-P^s.^ One sue* system is tone* ^sei*am 

^c Lto acid stances. Thespeci^isdem^syntoedetogoauctoattle 

^toZ of doubte-st-amtod DMA to opposite ortentedons. Temped cytong toe 
tZTrtM orates a continuous cycte of p^er toidi*. dansCptton. and ra^lhn, of 
Zlto acid .= todMdua, sdaads. 7ha mm * an exponent tocxease o, too targe. 
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dsDNA product This product can be quantified in real time either through the use of an 
intercalating dye or a sequence specific probe. SYBR® Gre ne I. is an example of an 
intercalating dye. that preferentially binds to dsDNA resulting in a concomitant increase in the 
fluorescent signal. Sequence specific probes, such as used with TaqMan® technology, 
consist of a fluorochrome and a quenching molecule covalentiy bound to opposite ends of an 
oligonucleotide. The probe is designed to selectively bind the target DNA sequence between 
the two primers. When the DNA strands are synthesized during the PCR reaction, the 
fluorochrome is cleaved from the probe by the exonuctease activity of the polymerase 
resulting in signal dequenching. The probe signaling method can be more specific than the 
intercalating dye method, but in each case, signal strength is proportional to the dsDNA 
product produced. Each type of quantification method can be used in multi-well liquid phase 
arrays with each well representing primers and/or probes specific to nucleic acid sequences 
of interest When used with messenger RNA preparations of tissues or cell fines, and an 
array of probe/primer reactions can simultaneously quantify the expression of multiple gene 
, products of interest See Germer. S.. et al.. Genome Res. 10:258-266 (2000); Heid. C. A. et 
al.. Genome Res. 6. 986-994 (1996). 

In a preferred embodiment. CA nucleic acids encoding CA proteins are used to make a 
variety of expression vectors to express CA proteins which can then be used in screening 

o assays, as described below. The expression vectors may be either self-replicating 

extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 
expression vectors include transcriptional and translational regulatory nucleic acid operably 
linked to the nucleic acid encoding the CA protein. The term "control sequences" refers to 
DNA sequences necessary for the expression of an operably linked coding sequence in a 

5 particular host organism. The control sequences that are suitable for prokaryotes. for 

example, include a promoter, optionally an operator sequence, and a ribosome b.nd.ng site. 
Eukaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid fe "operably linked" when it is placed into a functional relationship with another 
J0 nucleic acid sequence. Forexample. DNA for a presequence or secretory leader is operably 
linked to DNA for a polypeptide if it is expressed as a pieprotein that participates .n the 
secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence 
if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a 
coding sequence if it is positioned so as to facilitate translation. Generally, "operably ttnked" 
35 means that the DNA sequences being linked are contiguous, and. in the case of a secretory 
leader contiguous and in reading phase. However, enhancers do not have to be contiguous. 
Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, 
synthetic oligonucleotide adaptors or linkers are used in accordance with conventional 
practice The transcriptional and translational regulatory nucleic acid will generally be 
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app^ »m m «« used » « <* »-* to — * r^T"""* 

the CA protein m Mt Numerous typ« of appropriate exp-esston vectors, am. eufcUa 
regulator sequences are know in the art tor a variety of host cans. 

5 fn gen«al. the fanscrlpfiooa, and translationa, reouutory seouences ma, MM. but am not 
^ to .p ro m«erse<^^f«in 9 s^.«r^pW^^^ 
^uaoces. wns^ona, sort and step aerpmnces. and enhancer c acwaor saqc*^ fn 
a plfer-ed emhodlmrt th. »e<l«nc.s include a promoter and tmnsonphrml 

10 start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The promoters may 
be either natural* occurring promoters or hybrid promoters. Hybrid promoters. which 
combine e.ements of more than one promote, are also known in the art and are useful . the 
15 present invention. 

m addition, the expmssion »ctor may comprise additional rfcments. For example. Be 
exF .. = a ~aii* far ^nrefision and in a procaryotic hos 
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expression vector may have two replication - 

ZLs for example in mammalian or insect cells for expression and in a procaryotic host 

vector contains at least one sequence homologous to the 

Aaohomaogous sequences which «anK me expression construct 

be directed to a specific locus in the host cel. by selecting the appropnate homologous 

teqtnl for incTon in me vector. Constructs for integrating vectors are well Known ,n tine 



25 art 



, .^ imQ nt th* expression vector contains a selectable marker 
in addition in a preferred embodiment, the expression vwau 

^aL^se^nm.an^medhosto.na Sefechonpermaamwafmownin.ha 

art and will vary with the host cell used. 

- » P-en, awenKx, am Pruned h, eofhrrfn, a host ca. fmastemed 

app^oc^aona^^oroauaee^onof^CAp^. 

for CA protein expression w.1 »ar» «» •. choioa of the expert «c« and 
the hostc* and wilt he ear* ascerfaMd b» one stalled in the an fhmu* rouline 
e^enmanttUon. Formmmp te .»»useo.c»ns*f»epmmomm^aexp re ^»^» 
r ^ re ^n^th. g mw*andpm.^no.^hoa,^.wh»e te u«o,an^o«e 

plater reou.es Pre appropnate gnwth conditions lor nrduaion. In addthon. n «ma 
L m msec, oal, expression am «k -uses, and thus ha^as. hma sefcchon can he cnucm, 
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for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria. fungi, and insect plant and 
animal cells. Including mammalian cells. Of particular interest are Dmsophila me/anogasfer 
cells. Saccharomyoes cerevisiae and other yeasts. E coff. Badllus subtilis. Sf9 ce.ls. C1 29 
cells. 293 cells. Neurospora, BHK, CHO. COS. HeLa cells. THP1 cell line (a macrophage cel. 
line) and human cells and cell lines. 

in a preferred embodiment, the CA proteins are expressed in mammalian cells. Mammalian 
expression systems are also known in the art and include retroviral systems. A preferred 
expression vector system is a retroviral vector system such as is generally descnbed ,n 
PCT/US97/01019 and PCT/US97/01048. both of which are hereby expressly incorporated by 
reference Of particular use as mammalian promoters are the promoters from mammals 
viral genes, since the viral genes are often highly expressed and have a broad host range. 

s Examples include the SV40 early promoter, mouse mammary tumor virus LTR promoter, 
adenovirus major late promoter, herpes simplex virus promoter, and the CMV promoter. 
Typically transcription termination and polyadenylation sequences recognized by mammals 
cells are regulatory regions located 3' to the translation stop codon and thus, together with the 
promoter elements, flank the coding sequence. Examples of transcription terminator and 

o polyadenlytion signals include those derived form SV40. 

« 

The methods of introducing exogenous nucleic acid into mammalian hosts, as well as other 
hosts, is well known in the art. and will vary with the host cel. used. Techniques include 
dextran-mediated transection, calcium phosphate precipitation, polybrene medated 
25 transfection. protoplast fusion. electroporaBon. viral infection, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nude.. 

in a preferred embodiment. CA proteins are expressed in bacteria, systems. Bacterial 
expression systems are well known in the art Promoters from bacteriophage may also be 
30 used and are known in the art In add-on. synthetic promoters and hybrid promoters are also 
useful; for example, the tac promoter is a hybrid of the trp and tec promoter sequences 
Furthermore, a bacteria, promoter can include naturally occurring promoters of non-bactena. 
origin that have the ability to bind bacteria. RNA polymerase and initiate transition. In 
addition to a functioning promoter sequence, an efficient ribosome binding srte is des.rab.e. 
35 The expression vector may also include a signal peptide sequence that provides for secret 
of the CA protein in bacteria. The protein is either secreted into the growth media (gram- 
positive bacteria) or into the periplasmic space, located between the inner and outer 
membrane of the cell (gram-negative bacteria). The bacteria, expression vector may a.so 
include a se.ectab.e marker gene to aBow for th selection of bacteria, strains that have been 
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transformed. Suitab.e selection genes indud genes which render the bacteria resistant to 
drugs such as ampicillin. chloramphenicol, erythromycin, kanamycin. neomycin and 
tetracydine. Selectable markers also include biosynthetic genes, such as those ,n the 
histidine. tryptophan and .eudne biosynthe«c pathway, These components are a^ed 
into expression vectors. Expression vedors for baderia are we., known ,n the art. and .elude 
vedors for Bacillus subtilis. E. coll. Streptococcus cremoris. and Streptococcus Wans 
among others. The baderia. expression vedors are transformed into baderia. host cells 
using techniques well known in the art. such as calcium chloride treatment, eledroporafon. 
and others. 

,n one embodiment CA proteins are produced in insed ce.ls. Expression vedors for the 
transformation of insed cells, and in particular, baculovirus-based express,on vectors, are 
well known in the art 

, ln a purred embodiment CAprotein is produced in yeastcells. Yeast expression systems 
are we., known in the art and indude expression vedors for Saccnammyces «™~. 
Candida albicans and C. ma/tosa. Hansenuta polymorphs. Kiuyveromyces frag** and ic 
teft, Picnia g U ///en/no n d//and P. pastor*. Schizosaccnaromyces pombe. and Vam>w,a 
lipolytics. 

" The CA pntfoin may also f made » " — «— «• ^ "? T" L" ^ 

« for ex^e. „ die «a*n o, n^noe** ■ the <-*~" — 

protein may be made as a fusion protein to increase expression, or for other reasone For 
„ example. In M CA pro*, is an CA peptide, the nude* acid encodtng me pepude may 
be United to other nucleic acid for expression purposes. 

,„ one embodiment, the CA nucleic acids, proteins and arotbod* 0. ma Imronfcn « 
labeled. By Tabetorr heroin » mean, that a compound has atlaas. one etoment, t^roor 

ma, be .rcorporoted Into lire CAnudeic aolds. proton and antibody a. any poabon For 

35 sigMl .ThadaleoteW.moiatyma y baanKlioisoWe.suohaaH. C P. 

.uorescen, or chernlfcminescen, compound, such as «uoro**n isodt^e. modamme 
HTdn or an enc,me. such a, *»»r» phosphtnase. bota^osldaae or hocseta** 

An, metlod hnown In the an for c« -he andbod, 
emplced. including those methods rtoscnbed b, Hunter e. at.. Natuta. 144.945 (1962). 
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03,* eta,.. Otochemisd,. 13:1014 .1974); Pain a,*. X Immunol Med... «1. («*- 
Nygren. J. Histoehem. and Cytodrem. 30:40? (1982). 

Accordingly toe present Irw^on also pre«fesC> protein nuances. An CA prole* of toe 

S«, » MM m — ™ » 
po*pep«des. and peptides. As win Pa oppressed b, toose ,n toe * * -~ - 
Lquancaa af to. invention can be u*d to ganaral. P-olem s«,uen<*s. There are «*» 
TXa . do m. «uding to. en*, gene and ved*ng » frame ~ — • - 
so, „L. or by eompadng . to Known sequ**es to seareh to, M > »-* ' 
assuming toe CA protain has homology to soma protein in toe database be,ngu*d. 
ZZ « nuL a* sagaanoas a. inpa, » a P**«m toatw. «* , * M 
Lea » homotogy. 1W is do*. to a preferred ambodimant using toe fo«ow,ng NO. 
Advanced BLAST parameters. The program is blastx or biastn. The datobase « nr. The 
toutdato fe as "Sequence in FASTA former. Tha onanism M. W. The "expect-* 
, Ttoet^d^ ^-d^-foSC,..---^^^^- 
vfeWispairwise. The "query Gene«o Codes" is stondarad). *^""™ 9 * 
existence cost is 11. par residue gap cost is 1; and the lambda ratio is .85 default This 
results In toe general*, of a putative protein sequence. 

. Also todudadwitoin one embodto»n. of CAprefeins are ^no^ variants « die neh«.r» 
tlfegsequenees. as dotted hereto. ^•^^Z'THZ 
^ahlwhomofogoostotoewc.^ypese^e. ~^*~ t T£ 
80% even more preferably great., than about 85% and most preferably greafertoan go* 
"tZZLL. ere homofogy - be as h*h as ebou, 03 to 05 « 98* As fornudato 

!5 adds homology in due nontext means sequenee slmiWy or Wenffly. «dh identoy M. 

^.rh^w^b.detorminodus^etondaretochn.oashnownintoeadas 

are outlined above for die nucleic acid homologies. 

CA proteins of the present tow*. maybeshoderorlongerthenthewildtypeernino«Sd 
« H The. to a preferred embodiment M uded ».»h,n toe dedndion o, CA p»*ns 

« ctlo ad of toe invent may be used to obtoto add***, eo*e -e,«i, end 
,hus additional protain sequence, using techniques known in the ere 

35 ,„ a preferred embodiment toe CA protoins are derivedve or variant CA protoins as oon^ed 
" ^rltypesequanoe. Thal B .a,ou«nedn W efo,,betow.toededv3 a ve^pep M e- 
contain a. leas, one amino acid substitution, defetion or toserdon. wid, ammo ecd 

^ult^s b«n 9 prttftt P— ■ ™ — "» —** 
may occur at any residue within the CA peptide. 
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Also included in an embodiment of CA proteins of the present invention are amino acid 
sequence variants. These variants fall into one or more of three classes: subst.tut.onal. 
insertions or deletional variants. These variants ordinarily are prepared by site specific 
mutagenesis of nucleotides in the DN A encoding the C A protein, using cassette or PCR 
mutagenesis or other techniques well known in the art, to produce DNA encoding the vanant. 
and thereafter expressing the DNA in recombinant cell culture as outlined above. However, 
variant CA protein fragments having up to about 100-150 residues may be prepared by in 
vitro synthesis using established techniques. Amino acid sequence variants are 
characterized by the predetermined nature of the variation, a feature that sets them apart 
from naturally occurring allelic or interspecies variation of the CA protein amino acd 
sequence. The variants typically exhibit the same qualitative biological activity as the 
naturally occurring analogue, although variants can also be selected which have modrfied 
characteristics as will be more fully outlined below. 

While the site or region for introducing an amino acid sequence variation is predetermined, 
the mutation per se need not be predetermined. For example, in order to optimize the 
performance of a mutation at a given site, random mutagenesis may be conducted at the 
target codon or region and the expressed CA variants screened for the optimal combinaton of 
desired activity. Techniques for making substitution mutations at predetermined sites in DNA 
having a known sequence are well known, for example. M13 primer mutagenesis and LAR 
mutagenesis. Screening of the mutants is done using assays of CA protein a<*v.t.es. 

Amino acid substitutions are typically of single residues; insertions usually will be on the order 
of from about 1 to 20 amino acids, although considerably larger insertions may be tolerated. 
Deletions range from about 1 to about 20 residues, although in some cases deletions may be 
much larger. 
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30 



35 



Substitutions, deletions, insertions or any combination thereof may be used to arrive at a final 
derivative Generally these changes are done on a few amino acids to minimfce the alteration 
of the molecule. However, larger changes may be tolerated in certain circumstances. When 
small alterations in the characteristics of the CA protein are desired, substitutions are 
generally made in accordance with the following chart: 



Original Residue 



Chart I 

Exemplary Substitutions 
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Ala 

Arg 

Asn 

Asp 

Cys 

Gin 

Glu 

Gly 

His 

tie 

Leu 

Lys 

Met 

Phe 

Ser 

Thr 

Trp 

Tyr 

Val 
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Ser 
Lys 

Gin, His 

Glu 

Ser 

Asn 

Asp 

Pro 

Asn, Gin 
Leu, Val 
He, Val 
Arg, Gin, Glu 
Leu, He 
Met. Leu, Tyr 
Thr 
Ser 
Tyr 

Trp. Phe 
lie, Leu 



Substantia, changes in function or immunological Kent* are made by se.«*ng subs^ons 
Laretessconsen/ativethanthoseshowninChartl. For examp»e, substM.ons may be 

tne altera**, for example the alpha-helical or beta-sheet structure; the charge or 
hydrophobics of the motecule at the target site; or the bulk of the side cham. The 
substitutions which in genera, are expected to produce the greatest changes . the 
polypeptide's properties are those in which (a) a hydrophiiic residue, e.g. sery. or thn*>ny. ,s 
Si fo (o by) a hydrophob, residue, e.g. .eucy,. M I-*-"* va*. or 
I^NbTa cysteine or proiine is substituted for (or by) any other residue; (c) a res,due 

eZ^r^ue. e.g. glufcmy. or aspartyl; or (d) a re*due hav^g a bu* «. cha,. 
e.g. phenylalanine, is substituted for (or by) one not having a side cham. e.g. glyane. 

II - *. —-»—•• —*» — * MrianB *• 

« L****** o, to CA ^ as needed. «MM . *» vaaan m be 
desSnededchmettebio^eeBvdyenheCAp^eio^te-ed. For«e,np* 
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Covalent modifications of CA polypeptides are included within the scope of this invention, for 
example for use in screening. One type of covalent modification includes reacting targeted 
amino acid residues of an CA polypeptide with an organic deriving agent that capab.e of 
reacting with selected side chains or the N-or C-termina. residues of an CA polypeptide. 
Derivation wKh fractional agents is useful, for instance, for crosslinking CA polypeptides 
to a waternnsolubte support matrix or surface for use in the method for purifying anu-CA 
antibodies or screening assays, as is more fully described below. Common* used 
crosslink^ agents include. e.g.. i.1-bis(diazoacety.)-2-pheny.ethane. glutaraldehyde. N- 
hydroxysuccinimide esters, for example, esters with 4-azidosa«icy.ic acid, homobrfunctiona, 
imidoesters. including disuccinimidy. esters such as 3.3^iti,iobis(suc^nimidylp ro pionate) 
bifunctional maleimides such as bls-N-ma.eimidc-1. Octane and agents such as methyl 
l(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the 
, corresponding glutamy. and asparty. residues, respectively, hydration of proline and 

lysine, phosphorylation of hydroxy, groups of seryl. threony. or tyrosy. residues, methylation of 
me a-amino groups of fysine. arginine. and histidine side chains [T.E. ™9 hto »J™™ 
Structure and Molecular Properties. W.H. Freeman & Co.. San Francisco, pp. 79-86 (1983)1. 
acetylation of the N-terminal amine, and amidation of any Oterminal carboxyl group. 

Another type of covalent modification of the CA poly peptide included within the scope of this 
invention comprises altering the native glycosylation pattern of the polypeptide. "Afteringthe 
native glycosylation pattern" is intended for purposes herein to mean deleting one or more 
carbohydrate moieties found in native sequence CA polypeptide, and/or adding one or more 
25 glycosylation sites that are not present in the native sequence CA polypeptide. 

Addition of gfycosylation sites to CA polypeptides may be accomplished by altering the amino 
acid sequence thereof. The alteration may be made, for example, by the addition of. or 
substitution by. one or more serine or threonine residues to the native sequence CA 
30 polypeptide (for Winked glycosylate sites). The CA amino acid sequence may optionaHy 
be altered through changes at the DNA level, particular* by mutating the DNA encod,ng the 
CA polypeptide at preselected bases such that codons are generated that will translate ,nto 
the desired amino acids. 

35 Another means of increasing the number of carbohydrate moieties on the CA polypeptide is 
by chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are 
described in the art. e.g.. in WO 87/05330 published 1 1 September 1987. and in Aplin and 
Wriston. LA Crit. Rev. Biochem.. pp. 259-306 (1981). 
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Removal of carbohydrate moieti s present on the CA polypeptide 
chemicany or enzymaticaHy or by mutational substitution of codons ^ *"» "* 
residues that serve as targets for gfycosylation. Chemical deflation te chn,aues are 
known in the art and described, for instance, by Hakimuddin. et a... Arch. B.ochem. B,ophys., 
259:52 (1987) and by Edge et a... Anal. Biochem.. 118.131 (1981). Enzyma* deavageof 
carbohydrate moieties on polypeptides can be achieved by the use of a vane^ f dc -and 
exogVcosidases as described by Thotakura et a... Meth. Enzymo... 138:350 (1987). 

Another type of covalent modification of CA comprises .inking the CA po.y peptide to one of a 
, variety of nonproteins^ polymers, e.g.. polyene ^^^T' 
povoxyalkylenes. in the manner set forth in U.S. Patent Nos. 4.640.835. 4.496.689. 
4.301.144; 4,670.417; 4.791.192 or 4.179.337. 

CA polypeptides of the present invention may also be modified in a way to form ch«neric 
s mol.es comprising an CA peptide fcsed to another, heterologous pc*^e « " — ■ 
acid sequence. In one embodiment, such a chimeric mo.ecu.e compnses a fus.cn of an CA 
polypeptide with a tag poiypeptide which provides an eptope to which an anWag antibody 
can slLe.y bind. The epitope teg is generally P .aced at the aminc-or carboxy.-term.nus of 
the CA polypeptide. a.though interna, fusions may a.so be tolerated in some instances. The 
presence of such epitope-tegged forms of an CA polypeptide can be detected us '"^ *™ 
antibody againstthe tag polypeptide. Also, provision of the epitope ^enab^theCA 
pofypeptid to be readi.y purified by affinity purification using an anti-tag antibody or an^er 
Canity ma ti* that b.r*s to toe epfcope tag. In an alternative em^ment, to~ 
mLb may comprise a fusion of an CA polypeptide with an immunoglobuhn or a particular 
region of an immunog.obu.in. For a bivalent form of the chimeric molecule, such a fusK,n 
could be to the Fc region of an IgG molecule. 

Vanoas teg OoVpepWec and nKpecdae ancnodtea ,re we» te^ to .he « copies 
Mu de polir-hie.idW (poiy-hte) or poiy-htetidtoe^cine B9 ltl™ Z e 

po^pepL and «, antibody 12CA5 t Fte» a, a,.. Mai. Cat BM. 8:2,59.2165 1965ft »a 
%Z 1 « 5F9. 3C7. 6E10. Gd. B7 and 9E.0 anTCodfca the** lE^a. * ,m«* 
aLceL-Bioteoy. S:3510*615<1985ft and *e Han*a S^«^^^ m D «°» 

<?*«<» •> *• 3(6,:647 " 553 (,M0 "' "! TL 

« . . i c*:*.***. w-192-194 (1992)1; tubulin epitope peptide 
KT3 eoitoDe peptide [Martin et al. , Science, 255. 1 9Z-i 

Z m. (Mtt - gen. 10 _ P«P«e 

J nau-frayannua, a, a,.. Proe Nad. Acad. Sci. USA. .7*393*397 ,1990ft 

Also included with tha definition el CA protein in ona embodiment are ether CA proteins af the 
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CA family, and CA proteins from other organisms, which are cloned and expressed as 
outlined below. Thus, probe or degenerate polymerase chain reaction (PCR) primer 
sequences may be used to find other related CA proteins from humans or other organisms. 
As will be appreciated by those in the art, particularly useful probe and/or PCR primer 
5 sequences include the unique areas of the CA nucleic acid sequence. As is generally known 
in the art. preferred PCR primers are from about 15 to about 35 nucleotides in length, wrth 
from about 20 to about 30 being preferred, and may contain inosine as needed. The 
conditions for the PCR reaction are well known in the art 

io in addition, as is outlined herein. CA proteins can be made that are longer than those 
encoded by the nucleic acids of the figures, for example, by the elucidation of add.t.onal 
sequences, the addition of epitope or purification tags, the addition of other fusion sequences, 
etc. 

15 CA proteins may also be identified as being encoded by CA nucleic acids. Thus. CA proteins 
are encoded by nucleic acids that will hybridize to the sequences of the sequence listings, or 
their complements, as outlined herein. 

In a preferred embodiment, the invention provides CA antibodies. In a preferred embodiment. 

20 when the CA protein is to be used to generate antibodies, for example for immunotherapy, the 
CA protein should share at least one epitope or determinant with the full length protem. By 
-epitope" or "determinant" herein is meant a portion of a protein which will generate and/or 
bind an antibody or T-cell receptor in the context of MHC. Thus. In most instances, antibodies 
made to a smaller CA protein will be able to bind to the full length protein. In a preferred 

25 embodiment, the epitope is unique; that is. antibodies generated to a unique epitope show 
little or no cross-reactivity. 

,n one embodiment, the term -antibody" includes antibody fragments, as are known in the art. 
including Fab. Fab,, single chain antibodies (Fv for example), chimeric antibodies, etc.. erther 
3 o produced by the modification of whole antibodies or those synthesfced de novo us,ng 
recombinant DNA technologies. 

Methods of preparing polyclonal antibodies are known to the skilled artisan. Polyclona. 
antibodies can be raised in a mammal, for example, by one or more injections of an 
35 immunizing agent and, rfdesired, an adjuvant Typical*, the immunfeing agent and/or 

adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal .njections. 
The immunizing agent may include a protein encoded by a nucleic acid of the figures or 
fragment thereof or a fusion protein thereof. It may be useful to conjugate the immumzmg 
agent to a protein known fob immunogenic in the mammal being immunized. Examples of 
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such immunogenic proteins include but are not limKed to K yhoie .impet hemocyanin, serum 
albumin. boTne thyroglobulin. and soybean trypsin inhibitor. Examp.es of adjuvants wh,ch 
may b employed include Freund's complete adjuvant and MPL-TDM adjuvant 
(monophosphory. Lipid A. synthetic trehalose dicorynomycolate). The immunized protocol 
may be selected by one skilled in the art without undue experimental. 

Trie antibodies may. aKernattve,. be monoclona. an«bodfe, Monodona. angles may * 
prepared using hybridoma methods, such as those described by Kohler and mm. 
25*495 (1975). In a hybridoma method, a mouse, hamster, or other appropnate host ammal. 
> is typically immunfced wfth an immunWng agent to elicit lymphocytes that produce or are 
cap«b.e of produdng antibodies 

the oocytes may be immunfced In v*o. The immuring agar w „ 
include a po,peptide encoded by a nudeic add ofTab.es 1-112. orfragment thereof or a 
fusion proXreo, Genera,,. eHher periphera. btood ,mphocytes CPBLs ) are us* » 
5 cells of human origin are desired, or spleen cells or lymph node cells are used t non-hun* 
—an sources are desired. The ,mphocytes a. then fosed with an ce» 
„„e using a su«ab,e fusing agent such as po,ethy.ene glyco.. to form a ^ ^ 
coding. Monodona. Antibodies: Principles and Practice. Academ,c Press. (19*) p* 59- 
103] immortal** cel. lines are usual, transformed mammalian cells, part.cu.any myeloma 
, o cells of rodent, bovine and human origin. Usual,, rat or mouse ^efon« ceH ^ are 

employed. The hybridoma cells may be cultured in a suitab.e culture med,um that preferably 
contains one or more substances that inhibit the growth or survival of the unfused. 
immortalized cells. For example, if the parental cells lack the enzyme ^ X> || a "*'"^^ 
phosphoribosy. transferase (HGPRT or HPRT). the culture medium for the hy ndom* 
25 Epical, will indude hypoxanthine. aminopterin. and thymidine ("HAT med,um ). which 
substances prevent the growth of HGPRT-defident cells. 

,n one embodiment, the antibodies are bispecific antibodies. Bispedfic antibodies are 
monodona,, preferab, human or hummed, anybodies that have » 31 

30 leasttwodifferentantfcens. In the present case, one of the b,nd,ng specfiotes .s for a 
; o ilcodedbyaLeicaddofTab.esI^ 

preferably one that is tumor specific. 

US, ca mm. - ' 25% * "** 15 pretered - 
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preferred. 

,„ a prefeaad embodiment the ,n»bodtes to the OA am — ed .M-. 

o, otoe, am*-. tMn, eutKeouenoes of anbbodies, wtech oontem «mal «*aenoe 
aarfved torn non 1«" 1™""-***- a " Sb0< * S "**"", . . 

ZISr, o.m.radpiaa.^-ap^by^uea^aCDRa.anan.hu^spedas 

^asl^.Fva^^aaaa.^^^uno^N.a. 

capacity, in so Humanized antibodies may also compnse 

replaced by corresponding non.human residues. Humanizeo a 
residues which are found neither in the recipient antibody nor in the imported COR or 
^se.uences. in genera,, the humanfced antibody wU, compnse r- of at 

T tnn. ^tvoicallv two variable domains, in which all or substantially all of the CDR 

2Z1 rescues (FR) regions are those of a human immunoglobulin consensus sequence, 
framework residues ii-K.reg ™ mnrise at least a portion of an immunoglobulin 

The humanized antibody optimally also will comprise at least a pomon 

instant region (Fc) typically that of a human immunoglobulin Uones eta... Nature, 
constant region yc,. iy P y Cufr Qp 

321 :522_525 (1986); Riechmann et al.. Nature. 332.3ZJ_^» i »° /• 



Struct Biol.. 2:593_596 (1992)1 



»:u~4ioe ar»» well known in the art Generally, a 

Methods for humanizing non.human ant.bod.es are well known 

humanized antibody has one or more amino acid residues introduced into it from a source 
humanizeo anuuouy no& residU es are often referred to as import 

5 which is non.human. These non.human ammo aad res.dues are 0 

^^m*m**~^^"~Z; ™eTa Tature 
essentially performed following the method of Winter and coworkers [Jones et a... Nature. 
32V5* 52^ (1986); Riechmann et a... Nature. 332:323.327 (1988); Verhoeyen et a. 
^"23 - 534 536 (1988)]. by substituting rodent CDRs or CDR sequences forthe 
Science. 239.1534__iwo p» ;i. Accordingly such humanized antibodies are 

corresponding sequences of a human antibody. Accoroingiy. su« 

,o correspond, a ^ wher ein substantially less than an intact 

chimeric antibodies (U.S. Patent No. 4,816,567), wherein suds 
human variable domain has been substituted by the corresponding sequence from a 
H^L species in practice, humanfced antibodies are typically human antibodies in 
C^dues and possibty some FR residues are substituted by residues from 

3 5 analogous sites in rodent antibodies. 

Human embodies oan afco be adduced usiag »adous teohniaoes k™»n in the art. inching 

IHoogenbaam and WMr. , Mo,. «*. 227:3.1 
JSti 222:58, The techniques of Cole et ai and Boemer el al are alao 
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available for the preparation of human monoclona. antibodies [Cole et a... Monoclonal 
Antibodies and Cancer Therapy. /Man R. Uss. p. 77 (1985) and Boemer et at. J. Immunol.. 
147(1) 86 95 (1991)]. Similarly, human antibodies can be made by introducing human 
immunoglobulin tad into transgenic animals, e.g.. mice in which the endogenous 
immunog.obu.in genes have been partial* or complete* inactivated. Upon challenge, human 
antibody production is observed, which dose* resembles that seen in humans ,n ail respects, 
induding gene rearrangement assembly, and antibody repertoire. This ^approach 
described, for exampie. in U.S. Patent Nos. 5.545.807; 5.545.806; 5.569.825. 5.625.126. 
5 633 425- 5 661 .016. and in the following scientific publications: Marks et aL, 
, BioTTechndogy 10. 779.783 (1992); Lonbero et a... Nature 368 856.859 

Nature 368. 812 13 (1994); Fishwild et al.. Nature Biotechnology 14. 845.51 (1996); 

Neuberger. Natu're Biotechnology 14. 826 (1996); Lonberg and Huszar. Intern. Rev. Immunol. 

1365.93(1995). 

5 By immunotherapy » meant treatment of a carcinoma with an antibody raised against an C A 
pLein As used herein, immunotherapy can be passive or active. Passive monotherapy 
as defined herein is the passive transfer of antibody to a redpient (patient). Active 
immunization is me induction of antibody and/or T-cell responses in a redpient (patient) 
.nduction of an immune response is the result of providing the redpient with an antigen to 

, o whtah antibodies are raised. As appreciated by one of ordinary skid in the artthe > an^en 

may be provided by injecting a polypeptide against which antibodies are desued to be rased 
into a recipient, or contacting the recipient with a nudeic acid capable of expressing the 
antigen and under conditions for expression of the antigen. 

25 ,„ a preferred embodiment, oncogenes which encode secreted growth factors may be 

inhibited by raising antibodies against CA proteins that are secreted proteins as descnbed 
above. wLut being bound by theory, antibodies used for treatment, bind and prevent the 
secreted protein from binding to its receptor, thereby inactivating the secreted CA protem. 

30 in another preferred embodiment, the CA protein to which antibodies are raised is a 

LmemtLe protein. Wthout being bound by theory, antibodtes used for treatment b,nd 

Lext.cei.u.a^ 

as drcuiating Itaands or cell-associated molecules. The antibody may cause 
of the transmembrane CA protein. As wii. be appredated by one of ord.nary sk.ll ,n the art. 
3 5 the antibody may be a competitive, non-competitive or uncompetitive inhibitor of protem 

binding to the extrace..u.ar domain of the CA protein. The antibody is also an antagon* of 
the CA protein. Further, the antibody prevents activation of the transmembrane CA protem. 
,n one aspect, when the antibody prevents the binding of other molecules to the CA protein, 
in one asp ^ antibo dy may also sensitize the cell to cytotoxic 

the antibody prevents growth of the cell, ine anuoooy nw, «> w 
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agents inducing, but not limited to TNF-ct. W. IL-1 , INF-y and .L-2, or chemotherapeutic 
agents inciuding 5FU, vinblastine, actinomycin D. cisplatin. methotrexate, and the like. In 
some instances the antibody belongs to a sub-type that activates serum complement when 
complexed with the transmembrane protein thereby mediating cytotoxicity. Thus, carcinomas 
may be treated by administering to a patient antibodies directed against the transmembrane 
CA protein. 

in another preferred embodiment, the antibody is conjugated to a therapeutic moiety. In one 
aspect the therapeutic moiety is a small molecule that modulates the activity of the CA 
) protein. In another aspect the therapeutic moiety modulates the activity of molecules 

associated with or in close proximity to the CA protein. The therapeutic moiety may nhbt 
enzymatic activity such as protease or protein kinase activity associated with carcmoma. 

in a preferred embodiment, the therapeutic moiety may also be a cytotoxic agent In this 
5 method, targeting the cytotoxic agent to tumor tissue or cells, results in a reduction in the 
number of afflicted cells, thereby reducing symptoms associated with carcinomas, including 
lymphoma. Cytotoxic agents are numerous and varied and include, but are not limited to. 
cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and thar 
corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A cha.n. abnn A 
l0 chain, curcin. crotin. phenomycin. enomycin and the like. Cytotoxic agents also Include 

radiochemicals made by conjugating radioisotopes to antibodies raised against CA protons, 
or binding of a radionuclide to a chelating agent that has been covalently attached to the 
antibody. Targeting the therapeutic moiety to transmembrane CA protons not only serves to 
increase the local concentration of therapeutic moiety in the carcinoma of interest. ..e.. 
25 lymphoma, but also serves to reduce deleterious side effects that may be associated with the 
therapeutic moiety. 

,n another preferred embodiment, the CA protein against which the antibodies are raised is an 
intracellular protein. In this case, the antibody may be conjugated to a proton wh.ch 
3 o facilftates entry into the cel.. In one case, the antibody enters the cell ^ndc^tosis. in 

another embodiment, a nucleic acid encoding the antibody Is administered to the MM* or 
cell Moreover, wherein the CA protein can be targeted within a cell. i.e.. the nucleus, an 
antibody thereto contains a signal for that target location, i.e.. a nuclear localization signal. 

35 The CA antibodies of the invention specifically bind to CA proteins. By "specifically bind- 
herein is meant that the antibodies bind to the protein with a binding constant in the range of 
at least Iff 4 - Iff 6 M'\ with a preferred range being 10" 7 - 10 9 M' 1 . 

,n a preferred embodiment, the CA protein is purified or isolated after expression. CA 
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proteins may be isdated or purified in a variety of ways known to those sKiHed in the art 
depending on what other components are present in the sample. Standard punficabon 
methods include eiedrophoretic. mo.ecu.ar. immunotogica. and chromatography techn,^ 
including ion exchange, hydrophobe, affinity, and reverse-phase HPLC chromatography .and 
5 chromatofocusing. For examp.e. the CA protein may be purified using a 

antibody column. U.trafiltration and diafiitration techniques, in conjunction wrth rote.n 
conceltion. are a.so usefu,. For genera, guidance in suftabie purifica,on techn^ see 
Scopes. R. P Protein Purification. Springer-Veriag. NY (1982). The degree of 
necessary wfi. vary depending on the use of the CA protein. ,n some instances no punficat*n 

10 will be necessary. 

Once expressed and purified if necessary, the CA proteins and nucleic acids are usefu. in a 
number of applications. 

15 h one aapeot. tm axpreseion Mi of genes are de»»mined ftr afferent c*,a, -mm k •» 
seminoma pnenotype; that b, it-expression le«ls of genes to notmal ««»««'" 
c.^«^(^ln»om.o^for^«»^<'lW^^ re, « ,,> 
prognosis, as outlined below) are evaluated to provide expression profiles. An J^^^^ 
prol of a pardeuiar ce. e^ or poird of de^^ 

JLm. regam*, genes are airfare M 

^meacbofmesesafcaisobbdned. Then, diagnosis nay be dons „«Md does 
25 Luefmmapareeubrrpadentlwedreesneexpreesiw^ 

tissue. 

■ D «rerenba, exp-«eion.- or grammar awards as used derem. refb-a » both 
as area as quantitative dif^n=esinth.genesten^endfovoe»ular«p^npa««n,s 
3 „ and among fhe ce»s. Thus, a ddferendal* expressed gene oan dave da 

expression altered, including an activation or maoth-aiion. in. for example, nam* -ramus 

dsaue. Tba, is. gene, nte, be tented on ov brreed on » a p^ter 
„ anodrer elate. As . apparent ate «M arfiean. an, eompanson 
can be made. Sucn a„u*abve, regulated gene win e**«an axpressxrn paoern a 
35 state or 1 type «• is da**** by stands™ teobnigues In ovre aucb sb* orcd.lype.bu, 
is no, datectebte in both. Aiternaterety. me detemanabon is guanbtebve X M expresaon . 
^Ised or decreased; M h ma express o, me gene k ebner upregulated. res** . 
an kcreased amount of trensedpt. or deregulated, rearing in a decreased amount of 
Lscdpt Tbe deg.ee to wb«h express*, d*ere need on, be terge enough to ,uan«» »,a 



32 



PCTAJS01/51291 

WO 03/008583 

standard charact fixation techniques as outlined below, such as by use of Affymetrix 
GeneChip® expression arrays. Lockhari, Nature Biotechnology. 14:1675-1680 (1996). hereby 
expressly incorporated by reference. Other techniques include, but are not limited to. 
quantitative reverse transcriptase PGR. Northern analysis and RNase protect™. As outimed 
above, preferably the change in expression (i.e. upregu.ation or deregulation) is atleast 
about 500/.. more preferably at least about 100%. more preferably at least about 150%. more 
preferably, at least about 200-/.. with from 300 to at least 1000% being especial* preferred. 

As will be appreciated by those in the art. this may be done by evaluation at etoer the gene 
transcript, or the protein level; that is. the amount of gene expression may be monitored us,ng 
nucleic acid probes to the DNA or RNA equivalent of the gene transcript, and the 
quantification of gene expression levels, or. atomativefy. the final gene product itself (prnte-n) 
can be monitored, for example through the use of antibodies to the CA protein and standard 
immunoassays (ELISAs. etc.) or other techniques, including mass spectroscopy assays. 2D 
5 gel electrophoresis assays, etc. Thus, the proteins corresponding to CA genes. ,e. those 
identified as being important in a particular carcinoma phenotype. i.e.. lymphoma, can be 
•evaluated in a diagnostic test specific for that carcinoma. 

in a preferred embodiment, gene expression monitoring is done and a number of genes, i.e. 
o an expression profile, is montored simultaneously, although multiple protein 

mentoring can be done as wen. Similarly, these assays may be done on an ,nd,v,dual basis 



as well. 



,n this embodiment, the CA nucleic acid probes may be attached to biochips as outhned 
, 5 herein forthe detection and quantification of CAsequences in a particular cel.. The assays 
' are done as is known in the art As will be appreciated by those in the art. any number of 
different CA sequences may be used as probes, with single sequence assays be.ng used ,n 
some cases, and a plurality of the sequences described herein being used in other 
embodiments. In addition, white solid-phase assays are described, any number of solution 
3 o based assays may be done as well. 

,n a preferred embodiment, both solid and solution based assays may be used to detect CA 
sequences that are up-regu,ated or down-regulated in carcinomas as compared to normal 
tissue, in instances where the CA sequence has been atored but shows the same 
35 expression profile or an altered expression profile, the protein will be detected as outhned 



herein. 



,n a preferred embodiment nucleic adds encoding the CA protein are detected. Although 
DNA or RNA encoding the CA protein may be detected, of particular interest are methods 
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each one of multiple primary antibodies contains a distinct and detectable label. This method 
finds particular use in simultaneous screening for a plurality of CA proteins. As will be 
appreciated by one of ordinary skill in the art, numerous other histological imaging techniques 
are useful in the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability to detect 
and distinguish emissions of different wavelengths. In addition, a fluorescence activated cel. 
sorter (FACS) can be used in the method. 

in another preferred embodiment antibodies find use in diagnosing carcinomas from blood 
samples As previously described, certain CA proteins are secreted/circulating molecules. 
Blood samples, therefore, are useful as samples to be probed or tested for the presence of 
secreted CA proteins. Antibodies can be used to detect the CA proteins by any of the 
previously described immunoassay techniques including ELISA immunoblotting (Western 
blotting), immunoprecipitation. BIACORE technology and the like, as will be appreciated by 
one of ordinary skill in the art 

In a preferred embodiment, in situ hybridization of labeled CA nucleic acid probes to tissue 
arrays is done. For example, arrays of tissue samples, including CAtissue and/or normal 
tissue, are made. In situ hybridization as is known in the art can then be done. 

It is understood that when comparing the expression fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis as well as a prognosis. It is further 
understood that the genes which indicate the diagnosis may differ from those which indicate 
25 the prognosis. 

in a preferred embodiment, the CA proteins, antibodies, nucleic acids, modified proteins and 
cells containing CA sequences are used in prognosis assays. As above, gene express™ 
profiles can be generated that correlate to carcinoma, especially lymphoma, severity, in terms 
of long term prognosis. Again, this may be done on either a protein or gene level. w.th the 
use of genes being preferred. As above, the CA probes are attached to biochips for the 
detection and quantification of CA sequences in a tissue or patient The assays proceed as 
outlined for diagnosis. 

in a preferred embodiment any of the CA sequences as described herein are used in drug 
screening assays. The CA proteins, antibodies, nucleic acids, modified proteins and cells 
containing CA sequences are used in drug screening assays or by evaluating the effect of 
drug candidates on a "gene expression profile" or expression profile of polypeptides. In one 
embodiment the expression profiles are used, preferably in conjunction with high throughput 
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screening techniques to allow monitoring for expression profi. genes after treatment with a 
candidate agent. Zlokarnik. et a... Science 279. 84-8 (1998). Heid. t a... Genome Res.. 
6:986-994 (1996). 

,n a preferred embodiment, the CA proteins, antibodies, nuoieic acids, modified proteins and 
cel.sconta.ning the native or modified CA proteins are used in screening assays. That ,s. *e 
present invention provides novel methods for screening for compositions wtah modulate the 
carcinoma phenotype. As above, this can be done by screening for modulators of gene 
expression or for modulators of protein activrry. Similarly, this may be done on an md,v,dua. 
, gene or protein level or by evaluating the effect of drug candidates on a "gene express.cn 
profile' in a preferred embodiment, the expression profiles are used, preferably .n 
conjunction with high throughput screening techniques to al.ow monitoring for express.cn 
profile genes after treatment with a candidate agent, see Zlokarruk. supra. 

s Having identified the CA genes herein, a variety of assays to evaluate the effects of agents on 
gene expression may be executed. In a preferred embodiment, assays may be «n on an 
individual gene or protein level. That is. having identified a particular gene as aberrantly 
regulated in carcinoma, candidate bioactive agents may be screened to modulate the genes 
response. -Modulation" thus includes both an increase and a decrease in gene express.on or 
, o activity. The preferred amount of modutotion wii. depend on the original change of the gene 
expression in norma, versus tumor tissue, with changes of at least 10%. preferab* 50%. 
more preferably 100-300%. and in some embodiments 300-1000% or greater. Thus J ^a 
gene exhibits a 4 fold increase in tumor compared to norma, tissue, a decrease of about four 
fold is desired; a 10 fold decrease in tumor compared to normal tissue gives a 10 fold 
25 increase in expression for a candidate agent is desired, etc. Alternatively, where the CA 

LTueTce haTbeen altered but shows the same expression profi* or an altered express.cn 
profile, the protein will be detected as outfitted herein. 

As will be appreciated by these in the art, this ma, be done b» evaluate* a. a«h* the gene or 

proL and the quanta o, gene express mm. or. aPematere*. tee M d the £. 
product itself can be monitored, (Or example through the use of antibodies to the CA protein 
and stand,™ Immunoassays. A»ema«yel». binding and bieacWtyasaa^w* the protein 
may be done as outlined below. 

35 ma preferred embodiment, gene expression monitoring is done and a number of genes. i.e. 
an expression profile, is monitored simultaneously. aKhough multiple protein express.on 
monitoring can be done as well. 
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,n this embodiment, the CA nucleic acid probes are attached to biochips as outlined herein for 
the detection and quantification of CA sequences in a particular cel.. The assays are further 
described below. 

Generally, in a preferred embodiment, a candidate bioacth* agent is added to the cells prior 
to analysis. Moreover, screens are provided to identify a candidate bioacUve agent which 
modulates a particular type of carcinoma, modulates CA proteins, binds to a CA protein, or 
interferes between the binding of a CA protein and an antibody. 
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The term "candidate bioactive agent" or 'drug candidate" or grammatical equivalents as used 
herein describes any molecule, e.g.. protein, oligopeptide, small organic or inorganic 
molecule, polysaccharide, polynucleotide, etc.. to be tested for bioactive agents that are 
capable of directly or indirectly altering either the carcinoma phenotype. binding to and/or 
modulating the bioactivrty of an CA protein, or the expression of a CA sequence, including 
both nucleic acid sequences and protein sequences. In a particularly preferred embodiment 
the candidate agent suppresses a CA phenotype. for example to a normal tissue fingerpnnt. 
Similarly, the candidate agent preferably suppresses a severe CA phenotype. Generally a 
plurality of assay mixtures are run in parallel with different agent concentrations to obtain a 
differential response to the various concentrations. Typically, one of these concentrations 
serves as a negative control. t.e.. at zero concentration or below the level of detection. 

,noneas P ect.acandidateagentwi» neutralize the effect of an CA protein. By 'neutralize" is 
meant that activity of a protein is either inhibited or counter acted against so as to have 
substantially no effect on a cell. 

Candidate agents encompass numerous chemical classes, though typically they are organic 
or inorganic molecules, preferably small organic compounds having a molecular weight of 
more than 100 and less than about 2.500 daltons. Preferred small molecules are less than 
2000 orlessman1500or.essthan1000orlessthan500D. Candidate agents compnse 
functional groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl. hydroxy! or carboxy. group, 
preferably at least two of the functional chemical groups. The candidate agents often 
compnse cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic structures 
substituted with one or more of the above functional groups. Candidate agents are also found 
among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 
pyridines, derivatives, structural anatogs or combinations thereof. Particularly preferred are 

peptides. 

agents are obtained from a wide variety of sources including libraries of synthetic 
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or natural compounds. For example, numerous means are available for random and directed 
synthesis of a wide variety of organic compounds and biomolecules. including expression of 
randomized oligonucleotides. Alternatively, libraries of natural compounds in the form of 
bacterial, fungal, plant and animal extracts are available or readily produced. Additionally. 
5 natural or synthetically produced libraries and compounds are readily modified through 
conventional chemical, physical and biochemical means. Known pharmacological agents 
may be subjected to directed or random chemical modifications, such as acylation. alkylafjon. 
esterification. amidification to produce structural analogs. 

10 In a preferred embodiment, the candidate bioactjve agents are proteins. By 'protein" herein is 
meant at least two covalently attached amino acids, which includes proteins, polypeptides, 
oligopeptides and peptides. The protein may be made up of naturally occurring amino acids 
and peptide bonds, or synthetic peptidomimerjc structures. Thus "amino acid", or "peptide 
residue", as used herein means both naturally occurring and synthetic amino acids. For 

15 example, homo-phenylalanine, citrulline and noreleucine are considered amino acids for the 
purposes of the invention. "Amino acid" also includes imino acid residues such as proline and 
hydroxyproline. The side chains may be in either the (R) or the (S) configuration. In the 
preferred embodiment the amino acids are in the (S) or L-configuration. If non-naturally 
occurring side chains are used, non-amino acid substituents may be used, for example to 

20 prevent or retard in vivo degradations. 

In a preferred embodiment, the candidate bioactive agents are naturallyoccurring proteins or 
fragments of naturally occurring proteins. Thus, for example, cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of procaryotic and eucaryotic proteins may be made for screening in the 
25 methods of the invention. Particularly preferred in this embodiment are libraries of bacterial, 
fungal, viral, and mammalian proteins, with the latter being preferred, and human proteins 
being especially preferred. 
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In a preferred embodiment the candidate bioactive agents are peptides of from about 5 to 
about 30 amino acids, with from about 5 to about 20 amino acids being preferred, and from 
about 7 to about 15 being particularly preferred. The peptides may be digests of naturally 
occurring proteins as is outlined above, random peptides, or "biased" random peptides. By 
•randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 
may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 
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In one embodiment, the library is fully randomized, with no sequence preferences or 
constants at any position. In a preferred embodiment, the library is biased. That is. some 
positions within the sequence are either held constant, or are selected from a limited number 

5 of possibilities. For example, in a preferred embodiment, the nucleotides or amino acid 
residues are randomized within a defined class, for example, of hydrophobic amino acids, 
hydrophilic residues, sterically biased (either small or large) residues, towards the creation of 
nucleic acid binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc.. or to 

10 purines, etc. 

In a preferred embodiment, the candidate bioactive agents are nucleic acids, as defined 
above. 
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As described above generally for proteins, nucleic acid candidate bioactive agents may be 
naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. For 
example, digests of procaryotic or eucaryotic genomes may be used as is outlined above for 
proteins. 

In a preferred embodiment, the candidate bioactive agents are organic chemical moieties, a 
wide variety of which are available in the literature. 

in assays for altering the expression profile of one or more CA genes, after the candidate 
agent has been added and the cells allowed to incubate for some period of time, the sample 
containing the target sequences to be analyzed is added to the biochip. If required, the target 
sequence is prepared using known techniques. For example, the sample may be treated to 
lyse the cells, using known lysis buffers, electroporation. etc.. with purification and/or 
amplification such as PCR occurring as needed, as will be appreciated by those in the art. 
For example, an in vitm transcription with labels covalently attached to the nucleosides is 
done. Generally, the nucleic acids are labeled with a label as defined herein, with biotin-FITC 
or PE, cy3 and cy5 being particularly prefened. 

In a preferred embodiment, the target sequence is labeled with, for example, a fluorescent, 
chemiluminescent, chemical, or radioactive signal, to provide a means of detecting the target 
sequence's specific binding to a probe. The label also can be an enzyme, such as. alkaline 
phosphatase or horseradish peroxidase, which when provided with an appropriate substrate 
produces a product that can be detected. Alternatively, the label can be a labeled compound 
or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or altered by 
the enzyme. The label also can be a moiety or compound, such as. an epitope tag or b.obn 
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which specifically binds to streptavidin. For the example of biotin. the streptavidin is labeled 
as described above, thereby, providing a detectable signal for the bound target sequence. As 
known in the art. unbound labeled streptavidin is removed prior to analysis. 

As will be appreciated by those in the art. these assays can be direct hybridization assays or 
can comprise "sandwich assays", which include the use of multiple probes, as is generally 
outlined in U.S. Patent Nos. 5.681.702. 5.597.909. 5.545.730. 5.594.117. 5.591.584. 
5 571 670. 5.580.731. 5.571.670. 5.591.584. 5.624.802. 5.635.352. 5.594.118. 5.359.100. 
5 124 246 and 5.681.697. all of which are hereby incorporated by reference. In this 
embodiment, in general, the target nucleic acid is prepared as outlined above, and then 
added to me biochip comprising a plurality of nucleic acid probes, under conditions that allow 
the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, including high, 
moderate and tow stringency conditions as outlined above. The assays are generally run 
under stringency conditions which allows formation of the label probe hybridization complex 
only in the presence of target Stringency can be controlled by altering a step parameter that 
is a thermodynamic variable, including, but not limited to. temperature, fbrmam.de 
concentration, salt concentration, cnaotropic salt concentration pH. organic solvent 
20 concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally outlined in 
U.S. Patent No. 5.681.697. Thus it may be desirable to perform certain steps at h.gher 
stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways, as will be 
appreciated by those in the art Components of the reaction may be added simuitaneously. or 
sequentially, in any order, with preferred embodiments outlined below. In addition, the 
reaction may Include a variety of other reagents may be included in the assays. These 
include reagents like salts, buffers, neutral proteins, e.g. albumin, detergents, etc which may 
be used to facilitate optimal hybridization and detection, and/or reduce non-specrfic or 
background interactions. Also reagents that otherwise improve the efficiency of the assay, 
such as protease inhibitors, nuclease inhibitors, antimicrobial agents, etc.. may be used, 
depending on the sample preparation methods and purity of the target. In addition, either 
solid phase or solution based (i.e.. kinetic PCR) assays may be used. 

Once the assay is run. the data is analyzed to determine the expression levels, and changes 
in expression levels as between states, of individual genes, forming a gene expression profile. 
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In a preferred embodiment, as for the diagnosis and prognosis applications, having identified 
the differentially expressed gene(s) or mutated gene(s) important in any one state, screens 
can be run to alter the expression of the genes individually. That is, screening for modulation 
of regulation of expression of a single gene can be done. Thus, for example, particularly in 
5 the case of target genes whose presence or absence is unique between two states, screening 
is done for modulators of the target gene expression. 

In addition, screens can be done for novel genes that are induced in response to a candidate 
agent After identifying a candidate agent based upon its ability to suppress a CA expression 

10 pattern leading to a normal expression pattern, or modulate a single CA gene expression 
profile so as to mimic the expression of the gene from normal tissue, a screen as described 
above can be performed to identify genes that are specrfically modulated in response to the 
agent Comparing expression profiles between normal tissue and agent treated CA tissue 
reveals genes that are not expressed in normal tissue or CA tissue, but are expressed in 

is agent treated tissue. These agent specific sequences can be identified and used by any of 
the methods described herein for CA genes or proteins. In particular these sequences and 
the proteins they encode find use in marking or identifying agent treated cells. In addition, 
antibodies can be raised against the agent induced proteins and used to target novel 
therapeutics to the treated CA tissue sample. 
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Thus, in one embodiment, a candidate agent is administered to a population of CA cells, that 
thus has an associated CA expression profile. By "administration- or "contacting- herein is 
meant that the candidate agent is added to the cells in such a manner as to allow the agent to 
act upon the cell, whether by uptake and intracellular action, or by action at the cell surface. 
In some embodiments, nucleic acid encoding a proteinaceous candidate agent (i.e. a peptide) 
may be put into a viral construct such as a retroviral construct and added to the cell, such that 
expression of the peptide agent is accomplished; see PCT US97/01019. hereby expressly 
incorporated by reference. 

Once the candidate agent has been administered to the cells, the cells can be washed if 
desired and are allowed to incubate under preferably physiological conditions for some period 
of time. The cells are then harvested and a new gene expression profile is generated, as 
outlined herein. 

Thus, for example, CA tissue may be screened for agents that reduce or suppress the CA 
phenotype. A change in at least one gene of the expression profile indicates that the agent 
has an effect on CA activity. By defining such a signature for the CA phenotype, screens for 
new drugs that alter the phenotype can be devised. With this approach, the drug target need 
not be known and need not be r presented in the original expression screening platform, nor 
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does th level of transcript for the target protein need to change. 

in a preferred embodiment, as outlined above, screens may be done on individual genes and 
gene products (proteins). That is, having identified a particular differentially expressed gene 
as important in a particular state, screening of modulators of either the expression of the gene 
or the gene product itself can be done. The gene products of differentially expressed genes 
are sometimes referred to herein as "CA proteins" or an "CAP". The CAP may be a 
fragment, or alternatively, be the full length protein to the fragment encoded by the nuclac 
acids of Tables 1-112. Preferably, the CAP is a fragment In another embodiment the 
sequences are sequence variants as further described herein. 

Preferably, the CAP is a fragment of approximately 14 to 24 amino acids long. More 
preferably the fragment is a soluble fragment Preferably, the fragment includes a non- 
transmembrane region. In a preferred embodiment the fragment has an N-terminal Cys to 
aid in solubility. In one embodiment the c-terminus of the fragment is kept as a free acd and 
the n-terminus is a free amine to aid in coupling, i.e.. to cysteine. 

In one embodiment the CA proteins are conjugated to an Immunogenic agent as discussed 
herein. In one embodiment the CA protein is conjugated to BSA 

in a preferred embodiment screening is done to alter the biological function of the expression 
product of the CA gene. Again, having identified the importance of a gene in a particular 
state, screening for agents that bind and/or modulate the biological activity of the gene 
product can be run as is more fully outlined below. 



In a preferred embodiment screens are designed to first find candidate agents that can bud 
to CA proteins, and then these agents may be used in assays that evaluate the ability of the 
candidate agent to modulate the CAP activity and the carcinoma phenotype. Thus, as will be 
appreciated by those in the art there are a number of different assays which may be run; 
30 binding assays and activity assays. 

in a preferred embodiment, binding assays are done. In general, purified or isolated gene 
product is used; that Is. the gene products of one or more CA nucleic acids are made. In 
general, this is done as is known in the art For example, antibodies are generated to the 
35 protein gene products, and standard immunoassays are run to determine the amount of 
protein present Alternatively, cells comprising the CA proteins can be used in the assays. 

Thus in a prefened embodiment, the methods comprise combining a CA protein and a 
candidate bioacth/e agent, and determining the binding of the candidate agent to the CA 
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protein Preferred embodiments utilize th human or mouse CA protein, although other 
mammalian proteins may also be used, for example for the development of animal models of 
human disease. In some embodiments, as outlined herein, variant or derivative CA proteins 
may be used. 

Generally, in a preferred embodiment of the methods herein, the CA protein or the candidate 
agent is non-diffusably bound to an insoluble support having isolated sample receiving areas 
(e g. a microliter plate, an array, etc.). The insoluble supports may be made of any 
composition to which the compositions can be bound, is readily separated from soluble 
material and is otherwise compatible with the overall method of screening. The surface of 
such supports may be solid or porous and of any convenient shape. Examples of suitable 
insoluble supports include microliter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g.. polystyrene), polysaccharides, nylon or nitrocellulose. 
Teflon™*, etc. Microliter plates and arrays are especially convenient because a large number 
i of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 
with the reagents and overall methods of the invention, maintains the activity of the 
composition and is nondiffusable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 
0 protein is bound to the support), direct binding to 'sticky" or ionic supports, chemical 

crosslinking. the synthesis of the protein or agent on the surface, etc. Following binding ofthe 
protein or agent, excess unbound material is removed by washing. The sample reoemng 
areas may then be blocked through incubation with bovine serum albumin (BSA). casein or 
other innocuous protein or other moiety. 

In a preferred embodiment, the CA protein is bound to the support, and a candidate bioactive 
agent is added to the assay. Alternatively, the candidate agent is bound to the support and 
the CA protein is added. Novel binding agents include specific antibodies, non.natural 
binding agents identified in screens of chemical libraries, peptide analogs, etc. Of particular 
J0 interest are screening assays for agents that have a low toxicity for human cells. A w,de 
variety of assays may be used for this purpose, including labeled in who protein _prote,n 
binding assays, electrophoretic mobility shift assays, immunoassays for protein binding, 
functional assays (phosphorylation assays, etc.) and the like. 

3 5 The determination of the binding of the candidate bioactive agent to the CA protein may be 
done in a number of ways. In a preferred embodiment, the candidate bioactive agent is 
labeled, and binding determined directly. For example, this may be done by attaching all or a 
portion of the CA protein to a solid support, adding a labeled candidate agent (for example a 
fluorescent label), washing off excess reagent, and determining whether the label is present 
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pport. Various blocking and washing steps may be utilized as is known in the 



By "labeled" herein is meant that the compound is either directly or indirectly labeled with a 
label which provides a detectable signal. e.g. radioisotope, fluoresces enzyme, antibod.es. 
particles such as magnetic particles, chemiluminescers. or specific binding molecules, etc. 
Specific binding molecules include pairs, such as biotin and streptavidin. digoxm and 
antidigoxin etc. For the specific binding members, the complementary member would 
normally be labeled with a molecule which provides for detection, in accordance w.th known 
procedures, as outlined above. The label can directly or indirectly provide a detectable s.gnal. 

in some embodiments, only one of the components is labeled. For example, the proteins (or 
proteinaceous candidate agents) may be labeled at tyrosine positions using I. or wth 
fluorophores. Alternatively, more than one component may be labeled with different labels; 
using 125 l for the proteins, for example, and a fluorophor for the candidate agents. 

in a preferred embodiment, the binding of the candidate bioactive agent is determined 
through the use of competitive binding assays. In this embodiment, the competrtor .s a 
binding moiety known to bind to the target molecule (i.e. CA protein), such as an anbbody. 
peptide, binding partner, ligand, etc. Under certain circumstances, there may be competfve 
binding as between the bioactive agent and the binding moiety, with the binding mo.ety 
displacing the bioactive agent. 

,n one embodiment, the candWate bioactive agent is labeled. Either the candidate bioactive 
agent or the competitor, or both, is added first to the protein for a time sufficient to allow 
binding, if present Incubations may be performed at any temperature which faahtates 
optimal activity, typically between 4 and 40'C. Incubation periods are selected for optimum 
activity but may also be optimized to facilitate rapid high through put screening. Typ.cally 
between 0. 1 and 1 hour will be sufficient Excess reagent is generally removed or washed 
away. The second component is then added, and the presence or absence of the labeled 
component is followed, to indicate binding. 

,n a preferred embodiment, the competitor added first, followed by the candkiate bioactive 
agent Displacement of the competitor is an indication that the candidate bioactive agent .s 
binding to the CA protein and thus is capable of binding to. and potentially modulabng. the 
activity of the CA protein. In this embodiment either component can be labeled. Thus, for 
example, if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent Alternatively, if the candidate bioactive agent is labeled, the 
presence of the label on the support indicates displacement 
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in an alternative embodiment, the candidat bioactive agent is added first, with incubation and 
washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the bioactive agent is bound to the CA protein with a higher affinity. Thus, if the 
candidate bioactive agent is labeled, the presence of the label on the support, coupled with a 
lack of competitor binding, may indicate that the candidate agent is capable of binding to the 
CA protein. 

in a preferred embodiment, the methods comprise differential screening to identity bioactive 
agents that are capable of modulating the activity of the CA proteins. In this embodiment the 
methods comprise combining a CA protein and a competitor in a first sample. A second 
sample comprises a candidate bioactive agent, a CA protein and a competitor. The b.nd.ng of 
the competitor is determined for both samples, and a change, or difference in binding 
between the two samples indicates the presence of an agent capable of binding to the CA 
protein and potentially modulating its activity. That is. if the binding of the competitor .s 
different in the second sample relative to the first sample, the agent is capable of binding to 
the CA protein. 

Alternatively, a preferred embodiment utilizes differential screening to Identify drug candidates 
that bind to the native Ca' protein, but cannot bind to modified CA proteins. The structure of 
the CA protein may be modeled, and used in rational drug design to synthesize agents that 
interact with that site. Drug candidates that affect CA bioactivity are also identified by 
screening drugs for the ability to either enhance or reduce the activity of the protem. 
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Positive controls and negative controls may be used in the assays. Preferably all control and 
test samples are performed in at least triplicate to obtain statistically significant results, 
incubation of all samples is for a time sufficient for the binding of the agent to the protem. 
Following incubation, all samples aro washed free of non specifically bound materia, and the 
amount of bound, general labeled agent determined. For example, where a radiolabel • 
employed, the samples may be counted in a scintillation counter to determine the amount of 
bound compound. 

A variety of other reagents may be included in the screening assays. These include reagents 
like salts, neutral proteins. e.g. albumin, detergents, etc which may be used to fealrtate 
optima. protein_protein binding and/or reduce nonspecific or background interactions. Also 
reagents that otherwise improve the efficiency of the assay, such as protease inhibitors, 
nuclease inhibitors, antimicrobial agents, etc.. may be used. The mixture of components 
may be added in any order that provides for the requisite binding. 
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Screening for agents that modulate the activity of CA proteins may also be done. In a 
preferred embodiment, methods for screening for a bioactive agent capable of modulating the 
activity of CA proteins comprise the steps of adding a candidate bioactive agent to a sample 
of CA proteins, as above, and determining an alteration in the biological activity of CA 
proteins. 'Modulating the activity of an CA protein" includes an increase in activity, a 
decrease in activity, or a change in the type or kind of activity present Thus, in this 
embodiment the candidate agent should both bind to CA proteins (although this may not be 
necessary), and alter its biological or biochemical activity as defined herein. The methods 
include both in vitro screening methods, as are generally outlined above, and in vivo 
screening of cells for alterations in the presence, distribution, activity or amount of CA 
proteins. 

Thus, in this embodiment the methods comprise combining a CA sample and a candidate 
bioactive agent and evaluating the effect on CA activity. By "CA activity- or grammatical 
equivalents herein is meant one of the CA protein's biological activities, including, but not 
limited to. its role in tumorigenesis. including cell division, preferably in lymphatic tissue, cell 
proliferation, tumor growth and transformation of cells. In one embodiment CA activity 
includes activation of or by a protein encoded by a nucleic acid of Tables 1-112. An inhibitor 
of CA activity is the inhibition of any one or more CA activities. 

■ * • - * 

In a preferred embodiment the activity of the CA protein is increased; in another preferred 
embodiment, the activity of the CA protein is decreased. Thus, bioactive agents that are 
antagonists are preferred in some embodiments, and bioactive agents that are agonists may 
be preferred in other embodiments. 
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In a preferred embodiment the invention provides methods for screening for bioactive agents 
capable of modulating the activity of a CA protein. The methods comprise adding a candidate 
bioactive agent as defined above, to a cell comprising CA proteins. Preferred cell types 
Include almost any cell. The cells contain a recombinant nucleic acid that encodes a CA 
protein. In a preferred embodiment a library of candidate agents are tested on a plurality of 
cells. 

In one aspect the assays are evaluated in the presence or absence or previous or 
subsequent exposure of physiological signals, for example hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics. or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, bioactive agents are identified. Compounds with pharmacological activity are 
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able to enhance or interfere with the activity of the CA protein. 

,n one embodiment, a method of inhibiting carcinoma cancer cel. division, is provided. The 
method comprises administration of a carcinoma cancer inhibitor. 

in a preferred embodiment, a method of inhibiting lymphoma carcinoma cel. division is 
provided comprising administration of a lymphoma carcinoma inhibitor. 

in another embodiment, a method of inhibiting tumor growth is provided. The method 
10 comprises administration of a carcinoma cancer inhibrtor. In a particularly preferred 

embodiment, a method of inhibiting tumor growth in lymphatic tissue is provded compnsmg 
administration of a lymphoma inhibitor. 

,n a further embodiment, methods of treating cells or individuals wrth cancer are provided. 
The method comprises administration of a carcinoma cancer inhibrtor. Preferab*. the 
xs carcinoma is a lymphoma carcinoma. 

,n one embodiment a carcinoma cancer inhibrtor is an antibody as discussed above, .n 
another embodiment the carcinoma cancer inhibrtor is an antisense molecule. Antisense 
molecules as used herein include antisense or sense oligonucleotides compnsmg a singe- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for carcinoma cancer molecules. Antisense or sen** 
oligonucleotides, according to the present invention, comprise a fragment general* at feast 
al 14 nucfeotides. preferably from about 14 to 30 nucleotides. The ability to 
antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given protein 
• s dLbed in. for example. Stein and Cohen. Cancer Res. 48:2659. (1988) and van der Kro. 
et aL, BioTechniques 6:958, (1988). 

A*e„ee artecaie, «y be ftbodeoed «° a <- —"9 •» ^"^^JT" 
b, tanaboa of a ooa»ate w» a agaad biadiag faoiecufe. as descabed ,n WO M« 
Seifable *aad Kadff* fr*co*s lacfade. Ma* no. Mtf * «« — «"* 

r^ad awn, rao^c* .Ob* . ft cc^poadia, a^eoefe o, aeaep*.. »~°- 
taa^asaaraa^^aa^acascoaiasatad^^a^ca,. A^*.a 

se ^„anan fe ^*aauc^da m a»ba«odu=ad«».ca« c^laaaa, ^ 

nucleic acid seaueace by foaaa»oa of aa ol*oaadaa«da-r,pid donate, as descabed . WO 
90,10448 Kisuad^matmaaaaofantisaaaea.otecalaa.xkaookootaadla^.a 

^fna, aiso be ased k M assays as discussed above, ft add«*a . .ft*, o. 
treatment. 
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The compounds having the desired pharmacological activity may be administered in a 
physiologically acceptable carrier to a host, as previously described. The agents may be 
administered in a variety of ways, orally, parenteral^ e.g.. subcutaneously. intraperitoneally. 
intravasculariy. etc. Depending upon the manner of introduction, the compounds may be 
formulated in a variety of ways. The concentration of therapeutically active compound in the 
formulation may vary from about 0.1_100% wgWol. The agents may be administered alone 
or in combination with other treatments. i.e.. radiation. 

The pharmaceutical compositions can be prepared in various forms, such as granules, 
tablets, pills, suppositories, capsules, suspensions, salves, lotions and the like. 
Pharmaceutical grade organic or inorganic carriers and/or diluents suitable for oral and topical 
use can be used to make up compositions containing the therapeutically.active compounds. 
Diluents known to the art include aqueous media, vegetable and animal oils and fats. 
Stabilizing agents, wetting and emulsifying agents, salts for varying the osmotic pressure or 
buffers for securing an adequate pH value, and skin penetration enhancers can be used as 
auxiliary agents. 

Without being bound by theory, it appears that the various CA sequences are important in 
carcinomas. Accordingly, disorders based on mutant or variant CA genes may be 
determined. In one embodiment the invention provides methods for identifying cells 
containing variant CA genes comprising determining all or part of the sequence of at least one 
endogenous CA genes in a cell. As will be appreciated by those in the art. this may be done 
using any number of sequencing techniques. In a preferred embodiment, the invention 
provides methods of identifying the CA genotype of an Individual comprising determining all or 
part of the sequence of at least one CA gene of the individual. This is generally done in at 
least one tissue of the individual, and may include the evaluation of a number of tissues or 
different samples of the same tissue. The method may include comparing the sequence of 
the sequenced CA gene to a known CA gene. i.e.. a wild-type gene. As will be appreciated 
by those in the art. alterations in the sequence of some oncogenes can be an indication of 
either the presence of the disease, or propensity to develop the disease, or prognosis 
evaluations. 

The sequence of all or part of the CA gene can then be compared to the sequence of a 
known CA gene to determine if any differences exist. This can be done using any number of 
known homology programs, such as Bestfit etc. In a preferred embodiment the presence of 
a difference in the sequence between the CA gene of the patient and the known CA gene is 
indicative of a disease state or a pr pensity for a disease state, as outlined herein. 
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In a preferred embodiment, the CA genes are used as probes to determine the number of 
copies of the CA gene in the genome. For example, some cancers exhibit chromosomal 
deletions or insertions, resulting in an alteration in the copy number of a gene: 

In another preferred embodiment CA genes are used as probes to determine the 
chromosomal location of the CA genes. Information such as chromosomal location finds use 
in providing a diagnosis or prognosis in particular when chromosomal abnormalities such as 
translocations, and the like are identified in CA gene loci. 

o Thus, in one embodiment methods of modulating CA in cells or organisms are provided. In 
one embodiment, the methods comprise administering to a cell an anti-CA antibody that 
reduces or eliminates the biological activity of an endogenous CA protein. Alternatively, the 
methods comprise administering to a cell or organism a recombinant nucleic acid encoding a 
CA protein. As will be appreciated by those in the art, this may be accomplished in any 

.5 number of ways. In a preferred embodiment for example when the CA sequence is down- 
regulated in carcinoma, the activity of the CA gene is increased by increasing the amount of 
CA in the cell, for example by overexpressing the endogenous CA or by administering a gene 
encoding the CA sequence, using known gene-therapy techniques, for example. In a 
preferred embodiment the gene therapy techniques include the incorporation of the 

2 o exogenous gene using enhanced homologous recombination (EHR), for example as 

described in PCT/US93/03868, hereby incorporated by reference in its entirety. Alternatively, 
for example when the CA sequence is up-regulated in carcinoma, the activity of the 
endogenous CA gene is decreased, for example by the administration of a CA antisense 
nucleic acid. 

25 

In one embodiment the CA proteins of the present invention may be used to generate 
polyclonal and monoclonal antibodies to CA proteins, which are useful as described herein. 
Similarly, the CA proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify CA antibodies. In a 

30 preferred embodiment the antibodies are generated to epitopes unique to a CA protein; that 
Is, the antibodies show little or no cross-reactivity to other proteins. These antibodies find use 
in a number of applications. For example, the CA antibodies may be coupled to standard 
affinity chromatography columns and used to purify CA proteins. The antibodies may also be 
used as blocking polypeptides, as outlined above, since they will specifically bind to the CA 

35 protein. 

In one embodiment a therapeutically effective dose of a CA or modulator thereof is 
administered to a patient. By "therapeutically effective dose" herein is meant a dose that 
produces the effects for which it is administered. The exact dose will depend on the purpose 
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of the treatment, and will be ascertainable by one skilled in the art using known techniques. 
As is known in the art. adjustments for CA degradation, systemic versus localized delivery, 
and rate of new protease synthesis, as well as the age. body weight, general health, sex. diet, 
time of administration, drug interaction and the severity of the condition may be necessary, 
and will be ascertainable with routine experimentation by those skilled in the art 

A "patient" for the purposes of the present invention includes both humans and other animals, 
particularly mammals, and organisms. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
and in the most preferred embodiment the patient is human. 

The administration of the CA proteins and modulators of the present invention can be done in 
a variety of ways as discussed above, including, but not limited to. orally, subcutaneously. 
intravenously, intranasal*, transdermal*, intraperitoneal*, intramuscularly, intrapulmonary. 
i vaginally, rectally. or intraocularly. In some instances, for example, in the treatment of 
wounds and inflammation, the CA proteins and modulators may be directly applied as a 
solution or spray. 

The pharmaceutical compositions of the present invention comprise a CA protein in a form 

o suitable for administration to a patient In the preferred embodiment the pharmaceutical 
compositions are in a water soluble form, such as being present as pharmaceutically 
acceptable salts, which is meant to include both acid and base addition salts. 
"Pharmaceutically acceptable acid addition salt" refers to those salts that retain the biological 
effectiveness of the free bases and that are not biologically or otherwise undesirable, formed 

, 5 with inorganic acids such as hydrochloric acid, hydrobromic acid, sulfuric acid, nitric aod. 
phosphoric acid and the like, and organic acids such as acetic acid, propionic acid, glycol* 
acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic acid, fumaric acid, tartanc 
acid citric acid, benzoic acid, cinnamic acid, mandelic acid, methanesulfonic acid, 
ethanesulfonic acid, p toluenesulfonic acid, salicylic acid and the like. -Pharmaceutically 

30 acceptable base addition salts" include those derived from inorganic bases such as sodmrn. 
potassium, lithium, ammonium, calcium, magnesium, iron. zinc, copper, manganese, 
aluminum salts and the like. Particularly preferred are the ammonium, potassium, sodmrn. 
calcium and magnesium salts. Salts derived from pharmaceutically acceptable organic 
nontoxic bases include salts of primary, secondary, and tertiary amines, substituted amines 

35 including naturally occurring substituted amines, cyclic amines and basic ion exchange mm. 
such as isopropylamine. trimethylamine. diethylamine. triethylamine. tripropylarmne. and 
ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: carrier 
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proteins such as serum albumin; buffers; fillers such as micro-crystalline cellulose, lactose, 
com and other starches; binding agents; sweeteners and other flavoring agents; coloring 
agents; and polyethylene glycol. Additives are well known in the art, and are used in a variety 
of formulations. 

5 In a preferred embodiment, CA proteins and modulators are administered as therapeutic 

agents, and can be formulated as outlined above. Similarly, CA genes (including both the full- 
length sequence, partial sequences, or regulatory sequences of the CA coding regions) can 
be administered in gene therapy applications, as is known in the art. These CA genes can 
include antisense applications, either as gene therapy (i.e. for incorporation into the genome) 

10 or as antisense compositions, as will be appreciated by those in the art 

In a preferred embodiment. CA genes are administered as DNA vaccines, either single genes 
or combinations of CA genes. Naked DNA vaccines are generally known in the art. Brower. 
Nature Biotechnology. 16:1304-1305 (1998). 

In one embodiment. CA genes of the present invention are used as DNA vaccines. Methods 
for the use of genes as DNA vaccines are well known to one of ordinary skill in the art. and 
include placing a CA gene or portion of a CA gene under the control of a promoter for 
expression in a patient with carcinoma. The CA gene used for DNA vaccines can encode full- 
20 length CA proteins, but more preferably encodes portions of the CA proteins including 

peptides derived from the CA protein. In a preferred embodiment a patient is immunized with 
a DNA vaccine comprising a plurality of nucleotide sequences derived from a CA gene. 
Similarly, it is possible to immunize a patient with a plurality of CA genes or portions thereof 
as defined herein. Without being bound by theory, expression of the polypeptide encoded by 
25 the DNA vaccine, cytotoxic T-cells, helper T-cells and antibodies are induced which recognize 
and destroy or eliminate cells expressing CA proteins. 

In a preferred embodiment, the DNA vaccines include a gene encoding an adjuvant molecule 
with the DNA vaccine. Such adjuvant molecules include cytokines that increase the 
30 immunogenic response to the CA polypeptide encoded by the DNA vaccine. Additional or 
alternative adjuvants are known to those of ordinary skill in the art and find use in the 
invention. 

In another preferred embodiment CA genes find use in generating animal models of 
35 carcinomas, particularly lymphoma carcinomas. As is appreciated by one of ordinary skill in 
the art. when the CA gene identified is repressed or diminished in CA tissue, gene therapy 
technology wherein antisense RNA directed to the CA gene will also diminish or repress 
expression of th gene. An animal generated as such serves as an animal model of CA that 
finds use in screening bioactive drug candidates. Similarly, gene knockout technology, for 
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example as a result of homologous recombination with an appropriate gene targeting vector, 
will result in the absence of the CA protein. When desired, tissue-specific expression or 
knockout of the CA protein may be necessary. 

5 It is also possible that the CA protein is overexpressed in carcinoma. As such, transgenic 
animals can be generated that overexpress the CA protein. Depending on the desired 
expression level, promoters of various strengths can be employed to express the transgene. 
Also, the number of copies of the integrated transgene can be determined and compared for 
a determination of the expression level of the transgene. Animals generated by such 

io methods find use as animal models of CA and are additionally useful in screening for 
bioactive molecules to treat carcinoma. 

The CA nucleic acid sequences of the invention are depicted in Tables 1-112. The 
■ sequences in Tables 1 and 2 depict mouse tags, i.e. the genomic insertion sites. The 
is sequences in Tables 3-102 include genomic sequence, mRNA and coding sequences for 
both mouse and human. N/A indicates a gene that has been identified, but for which there 
has not been a name ascribed. The different sequences are assigned the following SEQ ID 
Nos: 

2 o Table 3 (mouse gene: Fscnl ; human gene SNL) 

Mouse genomic sequence (SEQ ID NO: 1) 

Mouse mRNA sequence (SEQ ID NO: 2) 

Mouse coding sequence (SEQ ID NO: 3) 

Human genomic sequence (SEQ ID NO: 4) 
25 Human mRNA sequence (SEQ ID NO: 5) 

Human coding sequence (SEQ ID NO: 6) 

Table 4 (mouse gene Map3k6; human gene MAP3K6) 
Mouse genomic sequence (SEQ ID NO: 7) 
3 o Mouse mRNA sequence (SEQ ID NO: 8) 
Mouse coding sequence (SEQ ID NO: 9) 
Human genomic sequence (SEQ ID NO: 10) 
Human mRNA sequence (SEQ ID NO: 11) 
Human coding sequence (SEQ ID NO: 12) 
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Table 5 (mouse gene Fosb; human gene FOSB) 
Mouse genomic sequence (SEQ ID NO: 13) 
Mouse mRNA sequence (SEQ ID NO: 14) 
Mouse coding sequence (SEQ ID NO: 15) 
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CLAIMS 

We claim: 

5 1 . A recombinant nucleic acid comprising a nucleotide sequence selected from the 
group consisting of the sequences outlined in Tables 1-112. 

2. A host cell comprising the recombinant nucleic acid of claim 1 . 



10 3. 
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An expression vector comprising the recombinant nucleic acid according to claim 2. 
A host cell comprising the expression vector of claim 3. 



5 A recombinant protein comprising an amino acid sequence encoded by a nucleic acid 
15 sequence comprising a sequence selected from the group consisting of the sequences 

outlined in Tables 1-112. 

6 A method of screening drug candidates comprising: 

a) providing a cell that expresses a carcinoma associated (CA) gene comprising a 
nucleic acid sequence selected from the group consisting of the sequences outlined in Tables 

1 -1 1 2 or fragment thereof; 

b) adding a drug candidate to said cell; and 

c) determining the effect of said drug candidate on the expression of said CA gene. 

25 7 A method according to claim 6 wherein said determining comprises comparing the 
level of expression in the absence of said drug candidate to the level of expression in the 
presence of said drug candidate. 

8 A method of screening for a bioactive agent capable of binding to an CA protein 
30 (CAP) wherein said CAP is encoded by a nucleic acid comprising a nucleic acid sequence 

selected from the group consisting of the sequences outlined in Tables 1-112. said method 
comprising: 

a) combining said CAP and a candidate bioactive agent; and 

b) determining the binding of said candidate agent to said CAP. 

9 A method for screening for a bioactive agent capable of modulating the activity of an 
CA protein (CAP), wherein said CAP is encoded by a nucleic acid comprising a nucleic acid 
sequence selected from the group consisting of the sequences outlined in Tables 1-112. said 
method comprising: 
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a) combining said CAP and a candidate bioactive agent and 

b) determining the effect of said candidate agent on the bioactivity of sa,d 
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CAP. 



5 10. A method of evaluating the effect of a candidate carcinoma drug comprising: 

a) administering said drug to a patient; 

b) removing a cell sample from said patient; and 

c) determining alterations in the expression or activation of a gene compnsmg , a 
nucleic add sequence selected from the group consisting of the sequences outi-ned ,n Tables 

io 1-112. 

11 A method of diagnosing carcinoma comprising: 

s^nce setecW from ft. group consl^ of the sequences ouUmedmTaMes1-112,lna 

first tissue type of a first individual; and 

comparing said expression of said gene(s) from a second norma, tissue type from 

said first individual or a second unaffected individual; 

wherein a deference In said expression indicates that the first individua. has carcnoma. 

12 A method for inhibiting the activity of a CA protein (CAP), wherein said CAP is 
encoded by a nucleic acid comprising a nudeic add sequence selected from the group 
consisting of the sequences outiined in Tabtes 1-112. said method compns.ng b,nd,g an 
inhibitor to said CAP. 

13 A method of treating carcinomas comprising administering to a patient an inhibitor of 
an CA protein (CAP), wherein said CAP is encoded by a nudeic add comprisln; , a 
acid sequence seleded from the group consisting of the sequences outhned n Tables 1-112. 

14 A method of neutralteing the effect of an CA protein (CAP), wherein said CAP is 
encoded by a nudeic acid comprising a nucleic add sequence seiected from the group 
Listing of the sequences outlined in Tables 1-1 12. comprising contacting an agent speafic 
rSSJ Protein «, said CAP protein In an amount sufficient to effect neufralfcatio, 

15 A polypeptide which specifically binds to a protein encoded by a nudeic acid 
comprising a nucleic acid seleded from the group consisting, of the sequences outhned ,n 



3s Tables 1-112. 



16 A polypeptide according to claim 15 comprising an antibody which specific^ binds 
to a protein encoded by a nucielc add comprising a nucleic acid sequence selected from the 
group consisting of the sequences outlined in Tables 1-112. 
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17 A biochip comprising one or more nucleic acid segments selected from the group 
consisting of a nucleic acid of the sequences outlined in Tables 1-1 1 2 or fragments thereof. 

18. A method of diagnosing carcinoma or a propensity to carcinoma by sequencing at 
least one CA gene of an individual. 

19. A method of determining CA gene copy number comprising adding an CA gene probe 
to a sample of genomic DNA from an individual under conditions suitable for hybridization. 
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